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PREFACE 


The  Lecture  Series  is  intended  to  provide  the  basic  theories  and  concepts  involved  in  the 
design  of  advanced  guidance  and  control  systems  employing  state-space  and  multi-variable 
design  methods.  An  intrinsic  part  of  this  Lecture  Series  will  be  computer-aided  and  graphical 
techniques  that  can  be  employed  in  preliminary  design  and  related  analysis  methods.  This 
will  provide  one  document  which  covers  the  necessary  design  background  and  state-of-the-art 
involved  in  the  application  of  advancing  technologies. 

Among  the  main  topics  to  be  reviewed  are: 

Analysis  and  Synthesis  Techniques 
Application  of  Observer  and  Estimation  Principles 
Computer-Aided  Design  and  Analysis  Methods 
System  Simulation  Techniques 
Tests  Evaluation  and  Validation 

There  will  be  a  round-table  discussion  at  the  end  of  the  presentations  during  which 
comments  and  suggestions  will  be  invited  from  participants. 


Dr  R.E.POPE 
Lecture  Series  Director 
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The  Need  for  Multivariable 
Design  and  Analysis  Techniques 


Rhall  E.  Pope 
Honeywell  Inc. 

Systems  and  Research  Center 
2600  Ridgway  Parkway  P.0.  Box  312 
Minneapolis,  Minnesota  USA  55440 


Summary 

Because  of  escalating  performance  demands,  modern  day  warfare  systems  are  becoming 
increasingly  complex.  Demands  for  higher  accuracy,  improved  reliability/survivability, 
and  more  automation  are  placing  increased  emphasis  on  the  control  function  for 
successful  operation.  Nowhere  is  this  increased  emphasis  more  evident  than  in  the 
control  functions  required  in  todays  and  future  aircraft  weapon  systems. 

Modifications  to  airframe  designs  directed  at  increased  maneuverability  and  reduced 
weight  are  placing  stringent  demands  on  flight  control  systems.  This  is  particularly 
true  for  advanced  fighter  aircraft  which  may  possess  relaxed  static  stability, 
additional  surfaces  for  direct  force  control,  and  active  structural  control 
requirements.  Fly-by-wire  systems,  particularly  digital  systems,  have  provided  the 
flexibility  to  not  only  accommodate  but  influence  airframe  design  modifications  and 
have  led  to  the  control  configured  vehicle  (CCV)  concept.  These  advances  in  air 
vehicle  design  and  flight  control  system  implementation  have  begun  to  overwhelm 
traditional  design  and  analysis  techniques  which  are  most  effective  on  aircraft 
applications  with  few  surfaces,  little  dynamic  coupling,  separation  between  flight 
control  and  other  avionic  subsystem  bandwidths  and  analog  system  implementations. 

In  addition,  stability  and  control  design  specifications  have  become  inadequate  in 
dealing  with  statically  unstable  vehicles,  multiple  control  loops  and  high  dynamic 
coupling.  Better  control  system  design  and  analysis  techniques  are  needed  which 
address  the  multi-input  closely  coupled  dynamic  nature  of  today's  and  tomorrows 
aircraft  weapon  systems. 


1.0  Introduction 

A  multivariable  flight  control  system  is  one  in  which  there  are  multiple 
interacting  control  loops.  This  interaction  is  dictated  by  the  dynamic  coupling 
resulting  from  the  aircraft  and  propulsion  system  design.  Until  recently,  aircraft 
were  designed  to  minimize  dynamic  coupling.  Propulsion  systems  were  designed  based 
primarily  on  forward  velocity  requirements. 

For  the  design  of  flight  control  systems  with  little  dynamic  coupling,  traditional 
or  so  called  classical  design  and  analysis  techniques  are  adequate  and  provide  key 
insights  into  the  fundamental  design  issues  of  feedback  control  systems.  The  design 
and  analysis  techniques  discussed  in  subsequent  chapters  are  directed  at  systems  whose 
control  loop  interaction  extend  the  utility  of  classical  techniques  to  the  point  where 

they  are  not  only  cumbersome  to  use  as  design  tools  but  produce  flight  control  system 

designs  with  undesirable  performance  characteristics.  The  inadequacies  of  classical 
design  techniques  are  by  no  means  accepted  facts.  There  has  been  continuous  debate 
over  the  last  two  decades  as  to  the  utility  of  classical  techniques  versus  the  utility 
of  non-classical  or  modern  techniques. 

It  is  useful  to  view  the  utility  issue  from  the  perspective  of  the  flight  control 
system  application,  particularly  as  flight  control  systems  have  evolved  over  the 
years.  In  depicting  that  history,  the  essential  Items  to  consider  are 

•  the  aircraft  application 

•  the  performance  requirements 

•  the  control  approach 

These  items  determine  the  utility  of  the  design  and  analysis  techniques. 

The  first  row  of  Table  1  represents  the  aircraft  application,  performance 
requirements,  control  approach,  and  design  and  analysis  techniques  for  early  aircraft 
up  through  aircraft  built  in  the  1950's.  For  these  applications  the  airframe  was 
designed  to  provide  stability  and  control  for  the  three  attitude  degrees  of  freedom  and 
the  propulsion  system  was  designed  for  speed  control.  The  control  approach  was  open 
loop  and  design  and  analysis  technques  were  airframe  oriented.  Feedback  control  design 
techniques  representative  of  classical  techniques  are  shown  for  systems  described  by 
row  2  of  Table  2.  In  this  case  also,  the  airframe  was  designed  to  provide  3  DOF 
attitude  stability  and  control  and  the  propulsion  system  provide  speed  control. 
Inadequate  airframe  designs  or  the  promise  of  improved  performance  resulted  in  feedback 
systems  which  were  used  to  augment  stability.  The  most  prevalent  example  of  such  a 
stability  augmentation  system  Is  a  yaw  damper.  The  introduction  of  feedback  control 
required  additional  design  and  analysis  techniques,  particularly  those  which  addressed 
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Table  1,  Aircraft  Control  Systems  Early  To 
Current  Operational  Systems 

the  stability  characteristics  of  feedback  control  systems.  Because  the  feedback 
control  design  was  very  simple,  involving  only  one  sensor  variable  and  one  surface 
command,  classical  techniques  were  very  effective  and  led  to  the  acceptance  of 
stability  margins  as  flight  control  system  design  specifications. 

Additional  demands  were  then  placed  on  flight  control  systems  in  the  form  of 
command  augmentation  systems  as  shown  in  row  3  of  Table  1.  The  airframe  application 
still  remained  the  same  with  or  without  a  need  for  stability  augmentation.  Handling 
quality  tests  determined  that  command  augmentation  provided  better  handling  qualities 
as  exemplified  by  the  acceptance  of  rate  command  systems  or  C»  systems.  Traditional 
design  techniques  were  still  adequate  for  design  since  despite  an  increase  in  the 
number  of  sensor  and  surface  pairs,  the  design  could  be  performed  one  pair  at  a  time 
because  of  the  loose  dynamic  coupling. 

The  introduction  of  command  augmentation,  however,  initiated  the  application  of 
modern  multivariable  techniques,  particularly  model  following  approaches.  These 
techniques  promised  to  facilitate  flight  control  system  design  thus  producing  better 
designs.  Despite  the  promise,  they  were  not  widely  accepted  by  practical  control 
system  designers. 

Fly-by-wire  systems,  particularly  digital  FBW  systems,  as  shown  in  row  4  of  Table 
1,  brought  new  issues  to  flight  control  design.  New  techniques  were  developed  and 
utilized  to  insure  that  digital  systems  performed  as  closely  as  possible  to  their 
analog  counterparts.  In  addition  the  availability  of  a  digital  computer  and  its 
associated  "unlimited"  computational  capability  on  board  the  aircraft  encouraged  more 
application  of  modern  techniques  which  promised  better  performance.  Again  these 
techniques  were  not  widely  accepted  by  practical  control  designers. 

The  systems  described  by  row  A  of  Table  1  represent  the  state  of  the  art  of  todays 
production  aircraft.  Table  2  presents  characterizations  for  current  experimental  and 
prototype  aircraft  and  projected  production  systems.  The  introduction  of  the  control 
configured  vehicle  (CCV)  concept  has  had  a  dramatic  effect  on  flight  control  systems. 
In  a  CCV  aircraft,  the  flight  control  system  is  not  merely  augmenting  stability  or 
improving  performance,  but  is  providing  a  flight  critical  stabilizing  and  control 
function.  The  criticality  of  the  flight  control  system  in  a  CCV  application  has 
intensified  the  need  for  efficient  and  reliable  design  and  analyses  techniques.  CCV 
aircraft,  in  themselves  however,  do  not  possess  dynamic  coupling  levels  which  make 
classical  design  techniques  intractable.  In  addition,  classical  techniques  directly 
address  stability  issues  and  have  therefore  been  much  more  attractive  to  a  designer  for 
CCV  control  design. 

Direct  force  control,  made  possible  by  additional  surfaces  and  thrust  vectoring  of 
the  propulsion  system,  as  characterized  by  row  2  of  Table  2  introduced  a  flight  control 
design  application  that  benefits  from  multivariable  control  techniques.  In  this 
application,  it  is  difficult  to  eliminate  closely  coupled  dynamics  in  the  airframe 
design.  Interaction  with  the  propulsion  system  can  magnify  the  coupling.  For  this 
application,  the  large  number  of  control  inputs  and  the  close  coupling  of  dynamics  can 
easily  overwhelm  classical  "one  loop  at  a  time"  techniques. 
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Table  2.  Aircraft  Control  System  Early  To 
Current  Operational  Systems 

The  problem  Is  projected  to  worsen  with  requirements  to  "integrate"  flight  control 
and  other  avionics  subsystems  as  shown  in  Row  3  of  Table  2  and  conceptually  in  Figure 
1.  This  integration  will  dramatically  increase  the  number  of  inputs  the  control  system 
must  handle  and  introduces  the  potential  for  interaction  among  loops  never  before 
encountered  in  operational  systems. 


In  the  sections  that  follow,  we  will  address  the  need  for  multivariable  design  and 
analysis  techniques.  In  Section  2.0,  a  statement  of  the  multivariable  control  problem 
will  be  presented.  Section  3.0  will  discuss  the  requirements  of  multivariable  design 
and  analysis  techniques  for  successfully  addressing  the  multivariable  control  problem. 
Section  4 . 0  will  present  two  examples  which  illustrate  the  need  for  better  techniques. 
Summary  and  conclusions  will  be  presented  in  Section  5.0. 

2.0  The  Multivariable  Control  Problem 

The  multivariable  control  problem  is  inherently  a  feedback  control  problem.  As 
such,  there  are  two  key  aspects  which  will  determine  the  utility  of  feedback  control 
design  and  analysis  approaches: 

•  the  control  application  and  its  reoresentation 

•  the  control  design  criteria 

The  elements  of  each  of  these  aspects  will  now  be  discussed. 

2.1  The  control  application  and  its  representations 

Before  beginning  any  control  system  design  and  analysis,  it  is  critical  to  have  a 
thorough  understanding  of  the  nature  and  requirements  of  the  control  application.  The 
understanding  of  the  application  can  assume  different  forms  and  can  be  obtained  from  a 
variety  of  sources.  The  basic  sources  are 

•  observations  of  the  behavior  of  the  application  to  be  controlled  or  a  similar 
application 

•  an  analytic  understanding  of  the  fundamental  behavior  of  an  application 
deduced  from  laws  of  physics,  chemistry,  etc. 

The  information  from  one  or  both  of  these  sources  can  be  assembled  in  a  form  that 
can  be  used  to  predict  how  the  application  will  behave.  These  forms  can  vary  in 
sophistication  from  "gut  feels",  to  laboratory  test  beds,  to  pilot  olant  or  prototypes 
to  complex  mathematical  "epresentat ions .  All  of  these  assemblages  of  information  or 
information  generators  can  loosely  be  termed  "models."  An  essential  point  to  remember, 
however,  is  that  the  model,  whatever  the  form  is  merely  a  device  to  aide  in 
understanding  the  behavior  of  the  application.  The  level  of  understanding  and  the 
associated  fidelity  of  the  model  differ  with  the  nature  and  the  requirements  of  the 
application.  Over  the  last  twenty-five  years,  the  models  used  in  the  aerospace  control 
community  have  been  much  more  mathematically  oriented  than  in  other  communities,  e.g., 
process  control.  The  primary  reason  for  this  is  the  lack  of  a  safe,  cost  effective  way 
to  gather  information  on  operational  systems  that  can  then  be  used  to  predict  system 
behavior.  The  Space  Shuttle  is  a  prime  example  of  the  difficulty.  Since  the  Shuttle 
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Figure  1.  Integrated  System  Representation 
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design .  For  example,  in 
effort  should  be  on  major  control  design  drivers  leading  to  a  simple  but  accurate, 
within  limits,  model  upon  which  numerous  control  desi 
performed.  At  the  acceptance  stage,  of  course,  the  mode 
fidelity  with  less  tradeoff  analysis  expected. 

Modeling  approximations  should  be  clearly  recognized, 
approximation  to  an  actual  system .  Every  attempt  should  Be  made  to  recognize  the 
impact  of  the  approximations  made, 
for  flight  control  system  design  are 

•  linearization  of  non  linear  dynamics  about  selected  operating  points 

•  neglect  of  some  non  linearities  (e.g.,  control  saturation) 

•  neglect  of  high  freguency  dynamics. 

Ti.ese  approximations  are  well  understood  and  in  general  their  impact  can  be 

guantified.  The  number  of  approximations  are  reduced  as  the  design  proceeds 
through  each  development  phase. 

Modeling  uncertainties  should  be  recognized.  A  model  uncertainty  is  different  than 
a  modeling  approximation.  Modeling  approximations  refer  to  those  actions  knowingly 
taken  by  a  control  designer  to  expedite  the  control  design  task, 
linearizing  a  non- linear  model  about  an  operating  point  is 

approximation.  Many  times,  however,  the  best  mathematical  models  that  can 

constructed  to  represent  the  behavior  of  a  system  still  contain  a 
uncertainty.  The  uncertainty  may  arise  from  the  lack  of  similar 

could  be  used  for  behavioral  comparisons  or  from  deficiencies  in  the 
of  the  physical  or  chemical  phenomena  involved.  As  discussed  in  Section  4.2,  the 

models  used  for  the  Space  Shuttle  Flight  Control  design  had  a  great  deal  of  model 

uncertainty.  To  deal  with  these  uncertainties,  the  potential  variations  in 

uncertain  modeling  variables  should  be  specified. 

The  conseguerices  of  modeling  approximations  and  uncertainties  should  be 
recognized .  Whether  a  model  of  system  behavior  is  approximated  or  whether  there  is 
uncertainty  in  the  model,  the  conseguences  of  the  model  limitations  as  they  impact 
control  system  design  must  be  recognized.  For  example,  truncating  high  freguency 
structural  dynamic  modes  at  a  particular  freguency  is  acceptable  if  it  is 

recognized  that  the  control  system  must  also  attenuate  these  high  freguency 

dynamics  above  that  freguency  or  be  insensitive  to  them. 

The  model  should  cuver  the  operating  range  of  the  application.  This  reguires  that 
the  design  points  be  chosen  so  as  to  uncover  and  exercise  all  of  the  control  system 
design  drivers.  The  number  of  design  points  chosen  should  be  sufficient  such  that 
the  assemblage  of  the  operating  ranges  of  the  control  systems  at  each  design  point 
cover  the  total  operating  range. 
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Since  control  system  design  and  analysis  technigues  generally  begin  with  an 
understanding  of  the  control  problem,  which  is  represented  in  model  form,  the  model 
form  and  characteristics  can  become  an  intimate  part  of  design  and  analysis  techniques. 


1-5 


2.2  The  Control  Design  Criteria 

Design  criteria  for  feedback  control  systems  can  be  lumped  into  three  broad 
categories:  Performance,  Stability,  and  Sensitivity.  In  order  to  set  the  stage  for 
defining  requirements  for  multivariable  design  and  analysis  techniques,  a  brief  review 
of  these  criteria  is  appropriate. 

Performance ■  Performance  criteria  are  generally  specified  as  a  requirement  for  the 
control  system  to  follow  commands  or  reject  disturbances.  An  example  of  a  command 
following  specification  is  shown  in  Figure  2.  This  Lype  of  performance  criteria  which 
is  used  in  the  Space  Shuttle  program  requires  that  the  vehicle  response  (in  the  case 
shown,  roll  rate)  to  a  step  input  (full  roll  stick  command)  must  fall  within  a 
specified  envelope.  Disturbance  rejection  criteria  can  take  a  number  of  forms. 
Disturbances  are  considered  to  be  externally  applied,  as  in  the  case  of  turbulence.  A 
typical  criteria  for  turbulence  rejection  would  be  to  not  exceed  a  specified  vehicle 
acceleration  level  during  turbulence. 


TIME 


Figure  2.  Space  Shuttle  Entry  Step  Command  Following  Response  Specifications 

Stability.  The  most  commonly  used  stability  criteria  are  stability  margins. 
Stability  margins  generally  refer  to  the  amount  of  gain  or  phase  variation  from  a 
design  condition  a  system  employing  feedback  control  Can  experience  before  becoming 
unstable.  The  gain  and  phase  margin  type  of  stability  margin  formulation  is  a 
consequence  of  the  original  techniques  (i.e..  Bode,  Nyquist,  Nichols)  that  were  used  to 
analyze  feedback  control  systems.  The  source  of  the  gain  or  phase  variation,  be  it 
modeling  approximations,  model  uncertainty,  change  in  operating  conditions  or 
combinations,  influences  the  magnitude  of  the  margins  desired.  The  Space  Shuttle 
stability  margin  inertia  which  are  fairly  typical  for  flight  control  systems  are  shown 
in  Table  3.  As  will  be  discussed  in  subsequent  papers,  there  are  limitations 
associated  with  using  this  type  of  stability  criteria. 

Sensitivity .  In  early  feedback  control  applications,  reducing  sensitivity  was  the 
only  control  design  criteria,  particularly  reducing  sensitivity  to  noise.  Over  the 
last  several  years,  sensitivity  issues  have  attracted  new  attention  and  have  generated 
a  new  requirement  -  robustness.  A  control  system  is  robust  (insensitive)  if 
performance  is  maintained  in  the  presence  of  variations  in  the  plant  from  the  design 
model.  These  variations  can  be  modeling  approximations,  modeling  uncertainties, 
failures  of  control  elements  (sensors,  actuators),  noise,  non-linearities,  etc.  with 
the  exception  of  noise,  there  are  no  well  accepted  criteria  for  dealing  with  these 
sensitivity  issues  at  the  present  time. 

These  criteria  are  not  mutually  exclusive.  Good  stability  characteristics  in  well 
designed  systems  usually  are  accompanied  by  good  performance.  Sensitivity  or 
robustness  must  be  considered  relative  to  the  variations  from  the  design  model  that  the 
control  system  will  experience.  The  criteria  may  also  be  formally  or  informally 
applied.  For  example,  satisfying  the  Shuttle  specs  shown  in  Figure  2,  was  a  reqirement 
for  control  system  acceptance.  The  sensitivity  Issues,  of  which  there  ware  many  in 
Shuttle,  had  less  rigidly  applied  criteria  associated  with  them. 


Table  3.  Space  Shuttle  Entry  Stability  Margin  Specifications 
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3.0  Requirements  for  Multivariable  Design  and  Analysis  Techniques 

Multivariable  deign  and  analysis  techniques  have  no  unique  requi rements  over  those 
classical  techniques  that  have  been  successfully  applied  to  single  input-single  output 
( S I  SO )  feedback  control  system  designs.  It  may  be  instructive,  however,  to  review  the 
characteristics  of  SISO  techniques  that  contributed  to  their  utility  and  widespread 
acceptance.  SISO  oriented  techniques  have  the  following  qualities: 

•  Provide  insight  into  key  feedback  control  issues 

•  Easy  to  use  with  experience 

•  Detailed  models  not  required 

Techniques  with  these  qualities  produced  control  systems  which  would  satisfy 
performance  specifications  with  a  hiqh  degree  of  confidence  and  with  very  little  design 
effort.  Probably  the  most  important  for  these  qualities  is  the  amount  of  insight  the 
technique  provides  of  understanding  control  system  drivers.  If  the  technique  directly 
addresses  a  design  criteria,  then  it  is  much  easier  and  quicker  to  make  intelligent 
design  decisions.  Techniques  that  do  not  provide  such  insights  can  be  frustrating  in 
their  lack  of  guidance  as  to  what  can  be  done  to  improve  the  design.  As  we  stated 
earlier,  control  applications  have  become  more  complex.  Because  of  this,  the  same 
characteristics  that  provided  utility  for  designing  and  analyzing  SISO  systems  become 
less  useful  for  multivariable  systems.  In  fact  they  become  difficult  to  use  and  can  be 
misleading  in  their  analysis  of  key  feedback  control  issues.  It  is  important  to 
understand,  however,  why  they  were  originally  so  attractive,  because  the  same  qualities 
should  be  striven  for  in  multivariable  design  and  analysis  technques. 

A . 0  Illustrative  Examples 

Two  examples  illustrate  the  need  for  multivariable  design  and  analysis  techniques. 
The  first  example  illustrates  the  multi-input  design  and  analysis  problem  associated 
with  an  integrated  flight  and  propulsion  control  system.  The  second  example  deals  with 
the  inadequacies  of  current  techniques  for  measuring  the  stability  characteristics  of 
systems  with  interacting  control  loop. 

4.1  Integrated  Flight  and  Propulsion  Control 

Air  warfare  systems  are  becoming  increasingly  complex.  Figure  3  presents 
dramatically  the  increase  in  control  variables  that  will  accompany  integration  of  air 

warfare  functions  in  future  systems.  The  most  significant  incremental  increase  in 

control  variables  will  occur  with  the  integration  of  the  flight  control  system  and  the 
propulsion  control  system  (engine,  inlet,  nozzle).  This  integration  promises  to 
improve  aircraft  maneuvering  capability,  reduce  fuel  expenditure,  and  extend  engine 

lifetimes.  The  control  modes  that  have  been  proposed  to  achieve  the  advantages  of 
coupling  the  propulsion  and  flight  control  system  are 

•  Energy  Management 

•  40  Navigation 

•  Minimum  Drag 

•  Maximum  Thrust 

•  Maneuver  Enhancement 

•  Automatic  Envelope  Limiting 

The  fundamental  issue  in  integrating  the  flight  and  propulsion  control  systems  is 

the  strong  interactions  that  may  be  expected  between  the  engines,  inlets,  and  airframe 
in  high  performance  aircraft,  particularly  in  the  supersonic  flight  regime.  Such 
interaction  has  been  demonstrated  in  the  YF-12  and  XB-70  aircraft  which  experience 
larger  pitch,  roll  and  yaw  moments  due  to  airframe/inlet  interaction.  In  the  YF-12, 
for  example,  the  rolling  moment  produced  by  the  inlet  bypass  doors  is  equivalent  to 
that  produced  by  the  primary  rolling  momment  generators,  the  ailerions.  The  typical 
maneuvering  of  higher  performance  fighters  will  subject  the  inlet  to  large  variations 
in  local  flow  angle,  local  mach  number  and  pressure  all  of  which  will  impact  engine 
per  formance . 

In  addition  to  the  standard  throttle  demands  placed  by  the  airframe  on  the  engine, 
fuel  consumption  will  alter  the  airframe  center  of  gravity  requiring  retrim.  These 
problems  are  insignificant,  however,  when  placed  in  comparison  to  vectored  thrust 

aircraft  like  the  Harrier  and  AV-8.  Designers  forced  with  integrating  complex  highly 

interacting  engine  inlet  and  flight  control  requirements  have  a  formidable  task. 

In  today's  turbofan  engines,  e.g.,  Pratt  and  Whitney  (P&W)  F-100,  the  fan  and 

compressor  margins  are  defined  to  accommodate  large  vehicle  maneuvers.  In  cruise 
situations  improvements  in  engine  efficiency  cculd  be  realized  if  the  operating  points 
were  moved  (i.e.,  uptrimmed).  The  operating  point  could  then  be  moved  prior  to  the 
onset  of  large  maneuvers  to  maintain  margins.  light  control  of  the  engines'  operating 
points  and  margins  requires  close  cooperation  amonq  the  various  subsystems.  This 

coordination  requires  the  use  of  multivariable  control  strategies. 

These  interactive  propulsion  and  flight  control  modes  listed  above  will  require 
both  inner  and  outer  loop  design  to  fully  realize  the  advantages.  Outer  loop 

trajectory  commands  or  optimum  set  points  can  be  generated  using  optimization 

techniques.  The  multivariable  control  problem  becomes  one  of  shaping  command  responses 
tn  those  outer  loop  commands  in  a  confidently  stable  control  configuration.  The  type 
of  control  and  measurements  that  are  available  for  the  problem  are  shown  in  Figure  4. 


Figure  4 


Control  and  Measurements  for  Integrated  Flight  and  Propulsion  Control 
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The  primary  design  issues  associated  with  the  integrated  flight  and  propulsion  system 
problem  reside  in  the  following  areas: 

Design  Criteria  -  Because  flight  and  propulsion  systems  have  traditionally  been 
designed,  and  operated  relatively  independently,  no  design  criteria  exist  for  how  they 
should  be  integrated. 

Design  Models  -  Again  because  of  the  automony  of  the  sytems,  interactive  effects  are 
generally  only  crudely  approximated.  In  addition,  the  engine  control  problem  is 
extremely  non  linear.  The  flight  conditions,  of  interest,  e.g.,  high  angle  of  attack 
maneuvering,  further  complicate  the  non  linear  problem. 

failure  Sensitivity  -  Because  flight  and  propulsion  are  more  integrated,  a  failure  in  a 
sensor  or  a  controller  in  either  system  now  has  a  more  significant  impact  on  the  total 
system.  Design  and  verification  of  integrated  flight  and  propulsion  control  systems 
that  tolerate  failures  is  a  major  challenge. 

Past  Air  Force  and  NASA  programs  such  as  the  F-lll  Integrated  Propulsion  Control 
System  (1PCS),  the  YF-12  Cooperative  Control  (CAPCS)  and  Various  Flight  Propulsion  and 
Control  Coupling  studies  have  examined  some  of  the  issues.  The  NASA  integrated 
Research  Aircraft  Control  Technology  (INTERACT)  currently  is  taking  a  new  look  in  an 
attempt  to  exploit  some  of  the  new  results  in  multivariable  control. 


A. 2  The  Space  Shuttle  Flight  Control  System 

Control  of  the  Space  Shuttle  Orbiter  during  the  entry  flight  phase  is  the  most 
demanding  flight  control  problem  control  designers  have  ever  faced.  The  major 
contributors  to  the  problem  are  a  statically  unstable  vehicle,  a  vast  operating 
envelope,  a  large  degree  of  uncertainty  in  modeling  the  vehicle  behavior,  and  a  control 
system  comprised  of  a  combination  of  linear  (aerodynamic  surfaces)  and  non-linear 
(reaction  control  jets)  devices. 


The  large  operating  envelope  is  shown  in  Figure  5.  The  orbiter  control  system  must 
operate  over  velocities  from  26,000  FPS  to  0  FPS,  dynamic  pressures  of  0  PSF  to  A00 
P5F,  and  angles  of  attack  from  0°  to  50°,  all  of  which  directly  determine  vehicle 
dynamic  response.  At  the  low  dynamic  pressure  portion  of  the  flight,  only  reaction 
control  jets  are  employed.  However,  during  the  majority  of  the  entry,  reaction  jets 
are  used  in  combination  with  the  aerodynamics  surfaces  for  lateral  directional  control. 


The  problem  that  most  plagued  shuttle  control  design  was  uncertainty  in  modeling 
the  vehicles  projected  behavior  coupled  with  potential  large  deviations  from  a  nominal 
trajectory.  The  modeling  uncertainties  were  related  to  aerodynamics,  structural 
flexibility,  and  flow  interaction  with  reaction  jets.  The  deviations  from  nominal 
trajectory  were  produced  by  atmospheric  conditions  (e.g.,  winds)  and  potential 
variations  in  vehicle  drag.  These  deviations  resulted  in  large  variations  in  dynamic 
pressure  which  is  a  key  flight  control  design  element. 


Figure  5.  Orbit  Entry  Parameters 
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The  approach  taken  to  determine  the  Impact  of  uncertainty  involved  a  number  of 
steps.  First,  statistical  bounds  were  placed  on  all  uncertain  parameters.  Statistical 
bounds  were  also  placed  on  potential  deviations  from  the  nominal  trajectory.  Second, 
design  criteria  were  established  which  required  that  the  control  system  operate  over 
the  statistically  defined  flight  envelope  which  included  not  only  deviations  from  a 
nominal  trajectory  but  variations  in  vehicles  aerodynamics  and  reaction  jet 
characteristics.  Third,  the  impact  of  these  statistical  uncertainties  was  then 
analyzed  with  respect  to  the  command  response  and  stability  margin  criteria  presented 
in  Section  2.0.  The  analysis  approaches  employed  were 

•  linear  transient  response  and  stability  analysis 

•  primarily  linear  with  selected  non  linearities  transient  response  analysis 

•  6  DOF  non  linear  transient  response  analysis  at  fixed  flight  conditions 

•  6  OOF  non  linear  transient  response  analysis  over  entry  trajectories 

•  full  up  simulations 

-  automatic 

-  man  in  the  loop 

As  demonstrated  by  the  successful  first  flight,  this  exhaustive  approach  did  produce  a 
successful  control  design. 

Because  of  the  uncertainty  issue,  the  Shuttle  is  a  good  example  to  illustrate  the 
inadequacies  that  currently  exist  in  analyzing  the  stability  characteristics  of  systems 
with  a  large  degree  of  uncertainty.  The  first  inadequacy  was  the  lack  of  analysis 
tools  that  gave  the  same  idea  of  a  "stability  margin"  for  analyzing  the  effect  of  the 
statistically  represented  uncertainties  described  above.  Because  these  statistical 
uncertainties  appeared  in  only  parts  of  the  model  and  in  different  combinations, 

standard  gain  and  phase  margin  analysis  techniques  did  not  apply.  Analysis  required 
exhaustive,  time  consuming,  costly  simulations  to  investigate  all  the  possible 

combinations. 

A  technique  which  would  map  an  envelope  of  uncertainty  to  a  stability  margin 

representation  would  have  been  extremely  useful  on  the  Shuttle  control  system  design. 
An  example  is  shown  in  Figure  6  where  a  two  parameter  (e.g.,  C^g  vs  C|_g  ) 

envelope  of  uncertainty  is  mapped  to  a  representation  of  closed  loop  eignevlues,  for 

both  open  and  closed  loop  behavior.  A  stability  margin  type  measure  might  be  the  curve 
Pj '  p£ '  which  represents  the  stability  boundaries  on  the  allowable  parameter 
variations.  This  of  course,  only  represents  a  two  parameter  case.  For  Shuttle  control 
system  analysis,  the  effect  of  ten  to  fifteen  parameters  and  all  the  possible 
combinations  had  to  be  determined. 

Another  area  of  concern  is  the  inadequacy  of  current  stability  analysis  techniques 
to  address  the  direction  of  an  uncertainty.  Classical  techniques  basically  only 

address  magnitude  variations  in  one  direction.  Preliminary  analysis  of  the  first 

flight  Shuttle  data  indicates  that  the  vehicle  roll  response  at  low  dynamic  pressure  to 

a  yaw  jet  firing  was  adverse  rather  than  proverse  as  was  expected.  If  this  effect  is 
taken  into  account,  the  stability  margins  deteriorate  from  satisfying  specs  to  a  -1  dB 
low  frequency  margin. 

These  two  examples  indicate  a  need  for  new  techniques  for  designing  and  analyzing 
control  systems  with  a  large  number  of  interacting  control  loops  and  levels  of 
uncertainty  in  predicting  system  behavior. 


P-PLANE  S-PLANE 
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Figure  6.  Uncertainty  Mapping  for  Stability  Analysis 


5.0  Summary  and  Conclusions 

A  case  for  the  need  for  new  multivariable  design  and  analysis  techniques  has  been 
presented.  Modern  air  warfare  systems  demand  these  techniques  in  order  to  satisfy 
increasingly  stringent  and  complex  performance  requirements.  As  we  progress  into  the 
space  environment,  the  control  of  large  flexible  space  structures  will  pose  even  more 
severe  design  challenges  particularly  in  the  area  of  model  testing  and  uncertainty. 
Our  current  established  techniques  are  becoming  over-taxed  and  are  forcing  us  to  rely 
more  heavily  on  costly  and  time  consuming  simulation  for  design.  Computer  aided  design 
approaches  are  required  to  assimilate  and  display  the  information  required  for  control 
system  design  that  are  easy  to  use  and  provide  insight  into  key  control  design  issues. 

As  exemplified  by  the  existence  of  the  lecture  series,  there  has  been  considerable 
interest  over  the  last  twenty  years  in  developing  new  design  techniques  that  address 
these  needs.  In  the  papers  that  follow,  a  broad  spectrum  of  techniques  will  be 
described  that  differ  widely  in  design  philosophy  and  approach.  It  is  our  hope  that 
the  descriptions  will  provide  the  essential  information  for  understanding  the 
advantages  of  one  technique  over  another  for  particular  design  problems. 

As  is  the  case  in  the  use  of  any  tool,  there  is  generally  no  one  tool  which  will 
solve  all  problems  with  optimal  efficiency.  The  most  efficient  tool  for  a  particular 
application  is  the  one  that  has  been  tailored  specifically  for  that  application.  At 
the  conclusion  of  this  lecture  series,  we  hope  to  have  provided  to  you  a  tool  box  from 
which  you  can  select  the  appropriate  tool  for  your  control  design  application. 
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CHARACTERISTIC  AND  PRINCIPAL  GAINS 
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SUMMARY 

The  key  to  the  generalisation  of  classical  frequency-response  design  techniques  to 
the  multivariable  case  lies  in  the  development  of  ways  of  suitably  extending  the  concepts 
of  gain  and  phase.  It  is  shown  how  algebraic  function  theory  can  be  used  to  generalise 
Nyquist  diagram  and  Root-Locus  diagram  techniques  for  use  with  systems  having  many  in¬ 
puts  and  outputs.  This  is  done  in  such  a  way  that  the  main  structural  features  of  such 
diagrams  can  be  related  to  state-space  model  parameters.  The  shortcomings  of  character¬ 
istic  gains  and  phases  (used  in  a  generalised  Nyquist  approach)  are  then  noted.  An 
alternative  way  of  introducing  amplitude  and  phase  information,  via  the  polar  decomposi¬ 
tion  of  an  operator,  leads  to  the  introduction  of  principal  gains  and  principal  phases. 
Their  properties  are  examined  and,  in  particular,  it  is  shown  how  they  may  be  used  to 
characterise  robustness. 

Finally  a  discussion,  with  an  example,  is  given  of  the  use  of  these  techniques  for 
design  purposes. 


1.  INTRODUCTION 

In  developing  a  design  technique,  one  seeks  to  give  practising  and  experienced 
engineers  a  set  of  manipulative  and  interpretative  tools  which  will  enable  them  to  build 
up,  modify  and  assess  a  design  put  together  on  the  basis  of  physical  reasoning  within 
the  guidelines  laid  down  by  their  engineering  experience.  Thus,  in  the  development  of 
design  techniques,  consideration  of  the  way  in  which  a  designer  interacts  with  a  computer 
is  vitally  important.  It  is  imperative  to  share  the  burden  of  work  between  computer  and 
designer  in  such  a  way  that  each  does  what  they  can  do  best.  The  development  of 
techniques  which  lead  to  a  fruitful  and  effective  symbiotic  relationship  between  computer 
and  designer  is  of  great  importance  in  the  future  development  of  control  engineering 
practice  since  it  is  the  only  conceivable  way  in  which  : 

(1)  the  designer  can  deploy  his  intuitive  skill  and  experience  while  still  making 
an  effective  use  of  advanced  theoretical  tools;  and 

(ii)  the  immense  and  ever-increasing  computational  power  of  computers  can  be 
harnessed  in  a  design  context. 

A  developer  of  new  engineering  design  techniques  has  two  harsh,  and  quite  different, 
criteria  to  satisfy  : 

(i)  he  must  provide  tools  which  are  acceptable  to  practising  engineers  and  which 
are  demonstrably  useful  and  effective; 

(ii)  he  must  show  that  his  techniques  are  based  on  rigorous  and  indisputably 
correct  mathematical  foundations. 

To  be  effective  for  an  engineering  designer,  a  design  technique  must  make  the  maximum 
use  of  his  intuition  and  experience.  The  need  to  provide  him  with  ar.  intuitively 
appealing  medium  for  communication  and  visualisation,  together  with  recent  developments 
in  computer  terminals,  places  a  high  premium  on  techniques  using  graphics  and  those 
which  are  based  on  geometrical  concepts.  What  we  are  concerned  with  here  is  the 
development  of  an  appropriate  conceptual  framework  within  which  a  designer  can  work. 

This  is  based  on  fairly  straightforward  concepts  in  algebraic  geometry  (the  theory  of 
curves)  and  linear  geometry.  It  is  shown  how  the  generalisation  of  the  classical 
frequency-response  tools  of  Nyquist  and  Root-Locus  diagrams  are  algebraic  curves  derived 
from  the  spectral  analysis  of  appropriate  matrix-valued  functions  of  a  complex  variable. 
Many  of  their  key  properties  can  be  comprehensively  analysed  in  terms  of  a  state-space 
model  of  the  vector  feedback  loop  which  is  being  studied.  This  analysis  Involves  simple 
linear  geometry  and  is  essentially  concerned  with  the  way  in  which  certain  subspaces, 
defined  via  the  various  operators  Involved,  sit  with  respect  to  one  another,  and  with 
ways  of  assigning  complex  frequencies  to  those  subspaces.  Finally,  the  techniques  out¬ 
lined  are  illustrated  by  an  example  of  their  use  in  design. 


2. 


GENERALISED  NYQUIST  AND  ROOT  LOCUS  DIAGRAMS:  CHARACTERISTIC  GAINS  AND 
CHARACTERISTIC  PHASES. 


The  original  Nyquist  stability  criterion  owed  its  central  importance  in  classical 
feedback  control  work  to  the  fact  that  it  tested  the  stability  of  a  proposed  feedback 
loop  in  terms  of  a  directly  measured  transmission  characteristic  [11.  This  radical 
departure  of  inferring  stability  from  measured  signal  characteristics  instead  of  from 
computed  dynamical-model  characteristics  placed  complex-function  methods  in  a  centrally- 
important  position  in  the  development  of  control  analysis  and  design  techniques  up  to 
the  late  1940s.  The  root-locus  technique  developed  in  the  1950s  [1)  added  further 
weight  to  the  arguments  for  using  comp  lex-variable  approaches  in  the  analysis  and  design 
of  feedback  systems.  At  the  time  of  the  introduction  of  the  root-locus  method,  the 
close  link  with  the  Nyquist  technique  was  not  appreciated;  one  of  the  virtues  of  the 
algebraic  function  approach  used  here  to  generalise  these  techniques  to  the  multivariable 
case  121  is  that  it  shows  how  both  these  classical  techniques  are  essentially  different 
ways  of  representing  curves  derived  from  a  single  algebraic  function  relating  complex 
frequency  and  complex  gain. 

The  dual  role  of  complex  numbers  is  familiar  from  elementary  complex  number  theory: 
they  can  be  used  to  represent  both  objects  or  quantities  (e.g.  vectors  in  a  plane)  and 
operations  on  objects  (e.g.  rotation  and  stretching  of  vectors  in  a  plane).  This  dual 
role  is  important  in  classical  (i.e.  single-input  single-output)  frequency-response 
methods  where  complex  numbers  are  used  to  represent  frequencies  (complex  numbers)  and 
gains  (operations  on  complex  vectors).  The  two  classical  approaches  to  the  complex- 
variable-based  analysis  of  feedback  systems  : 

(i)  study  (open-loop)  gain  as  a  function  of  frequency  (the  Nyquist-Bode  approach); 
or 

(ii)  study  (closed-loop)  frequency  as  a  function  of  gain  (the  Evans  Root-Locus 
approach) . 

Our  first  objective  is  to  show  that  there  is  an  intimate  relationship  between  the 
generalisation  of  these  two  approaches  to  the  multivariable  (many-input  many-output) case 
and  the  standard  state-space  model 

l  :  X  =  Ax  t  Bu  (2.1) 

y  =  Cx  +  Du 

In  much  of  what  follows  the  matrix  D  will  be  taken  to  be  zero  for  simplicity  of 
exposition.  However  its  inclusion  at  this  point  gives  completeness  and  symmetry 
to  the  development  of  the  general  case.  [A  is  n*n;  B  is  n*i;  C  is  mxn;  D  is  mxt  . 

When  vector  feedback  loops  are  being  discussed  1=  m]  .  Suppose  we  sever  m  feedback 
connections  to  an  arbitrary  linear  dynamical  system  as  shown  in  Fig.l,  and  that  the 
transmittance  between  the  injection  point  a  and  the  return  a'  is  given  by  a  state-space 
model  of  the  form  shown  in  equation  (2.1). 
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Then,  on  remaking  the  feedback  connections,  we  see  that  the  closed- loop  characteristic 
frequencies  of  this  system  will  be  given  by  the  spectrum  (set  of  eigenvalues)  of  the 
matrix 

+  B ( I  -D) _1C 
in 
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(2.2) 


2-3 


and  that  the  open- loop  transfer  function  matrix  relating  vector  exponential  signals 
injected  at  a  to  the  vector  exponential  signals  returned  at  a!  is 

G(s)  =  D  +  C(sln-A)_1B  (2. 3j 

(I  and  I  are  unit  matrices  of  orders  m  and  n  respectively.)  If  one  compares  the 
right-hana  sides  of  (2.2)  and  (2.3)  they  are  seen  to  have  a  suggestive  structural 
similarity.  This  becomes  an  exact  structural  equivalence  if  we  introduce  a  variable  g 
and  change  (2.2)  to  the  form 

S(g)  =  A  +  B(gIm-D)_1C  (2.4) 

where  the  role  of  the  variable  g  will  emerge  shortly.  Our  aim  is  to  put,  in  a  state- 
space-model  context,  the  roles  of  open-loop  gain  (the  key  Nyquist-Bode  concept)  and 
closed- loop  frequency  (the  key  Evans  Root-Locus  concept)  on  an  exactly  equal  footing. 

Kith  this  in  view  we  look  for  a  system  giving  the  closed-loop  frequency  matrix  o^ 
equation  (2.4)  and  find  that  it  is  as  shown  in  Fig. 2,  where  g  is  now  interpreted  as  a 
gain  parameter.  On  redrawing  Fig. 2  in  the  form  shown  in  Fig. 3  we  see  that  the  variables 
s  and  g  have  indeed  been  placed  in  a  strikingly  symmetrical  relationship. 


In  Fig. 3  s  is  to  be  interpreted  as  a  complex  frequency  variable  and  g  as  a 
complex  gain  variable.  For  the  related  vector  feedback  loop  (with  m  inputs  and  m 
outputs)  the  open-loop  gain  matrix 

G(s)  =  D  +  C (sIn-A)_1B 

describes  open-loop  gain  as  a  function  of  imposed  frequency  s  ;  and  the  closed-loop 
frequency  matrix 

S (g)  =  A  +  B(gIm-D)-1C 

describes  closed- loop  frequency  as  a  function  of  the  imposed  gain  parameter  g. 
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Figure  4 
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For  the  negative  feedback  arrangement  of  Fig. 4  if 

F (s)  =  I  +  g-1G(s)  (2.5) 

m 

is  the  return-difference  matrix  for  the  set  of  feedback  loops  broken  at  a-a'  then  one  can 
show  that  [ 3  1 


dettsl  -S(g)J  detl gIm~G(s) ]  detF(s) 
dett  sIn-S  {»)  f  det[gIm-G("°)  f  detF(=°) 

_  CLCP (s) 

~  OLCP (s) 


(2.6) 


where  CLCP(s)  and  OLCP(s)  are  the  system's  closed-loop  and  open-loop  characteristic 
polynomials  respectively.  The  importance  of  equation  (2.6)  is  that  it  tells  us  that  in 
this  archetypal  negative  feedback  situation  we  may  equally  well  study  the  effect  of 
closing  feedback  loops  on  stability  in  terms  of  an  open-loop  gain  description  or  a  closed- 
loop  frequency  description.  Specifically  it  shows  that,  for  values  of  s  not  in  the 
spectrum  of  A  and  values  of  g  not  in  the  spectrum  of  D 

det  rsl  -S(g))  =  O^^detCg  Im-G(s)  )  =  0  (2.7) 

This  in  effect  says  that  a  knowledge  of  how  the  characteristic  values  of  G(s)  vary  as  a 
function  of  the  frequency  parameter  s  is  equivalent  (for  the  purposes  of  making 
inferences  about  closed-loop  stability)  to  a  knowledge  of  how  the  characteristic  values 
of  S(g)  vary  as  a  function  of  the  gain  parameter  g. 

2 . 1 _ Characteristic  frequencies  and  characteristic  gains. 

For  a  given  value  of  the  gain  parameter  g  the  eigenvalues  (s^  :  i  =  1,2,. ...n} 
of  S(g)  are  the  corresponding  set  of  closed- loop  characteristic  frequencies,  obtained 
from 


det[sln  -  S(g)  ]  =  0  (2.8) 

For  a  given  value  of  the  frequency  parameter  s  the  eigenvalues  {g^  :  i  =  l,2,...,m)  of 
G(s)  may  be  called  the  corresponding  set  of  open-loop  characteristic  gains ,  obtained  from 

det[ glm  -  G ( s ) 1=  0  (2.9) 

The  closed-loop  characteristic  frequencies  { s  ^  >  for  a  given  value  of  g  are,  as  is  well 
known,  associated  with  invariant  subspaces  in  the  state  space  3E .  The  open-loop 
characteristic  gains  { g . }  for  a  given  value  of  s  are  associated  with  invariant  sub¬ 
spaces  in  the  input  spaie  l C  (see  Fig. 3),  whose  physical  interpretation  will  be  discussed 
after  stating  the  Generalized  Nyquist  Stability  Criterion. 

From  the  characteristic  equations  for  G(s)  and  S(g),  namely 

A ( g , s)  =  det  i gl  -  G(s)  )  =  0  (2.10) 

and  . 

V (s,g)  =  det  tsln  -  S (g) ]  =  0  (2.11) 

one  obtains  a  pair  of  algebraic  equations  relating  the  complex  variables  s  and  g. 

These  define  a  pair  of  algebraic  functions  [3]  : 

(i)  a  characteristic  gain  function  g(s)  which  gives  open-loop  characteristic 
gain  as  a  function  of  frequency;  and 

(ii)  a  characteristic  frequency  function  s(g)  which  gives  a  closed-loop  character¬ 
istic  frequency  as  a  function  of  gain. 

The  importance  of  these  two  algebraic  functions  lies  in  the  fact  that  they  are  the 
natural  means  of  generalising  the  concepts  of  Nyquist  diagram  and  Root-Locus  diagram  to 
the  multivariable  case  (31. 

2.2  Generalised  Nyquist  Diagrams  and  the  Generalised  Nyquist  Stability  Criterion 

The  characteristic  gain  loci  (generalised  Nyquist  diagrams)  for  the  m-vector 
feedback  loop  with  transmittance  matrix  G(s)  are  the  loci  in  the  complex  gain  plane 
which  are  traced  out  by  the  eigenvalues  of  G(s)  as  s  traverses  the  so-called  Nyquist 
D-contour  in  the  complex  frequency  plane  (s-plane) .  They  can  also  be  defined  as  the 

-90°  constant-phase  contours  on  the  g-plane  of  the  algebraic  function  s(g).  Their 
utility  lies  in  their  role  in  the  following  sort  of  generalisation  of  the  Nyquist 
Stability  Criterion  (  3  I  ,  (4  1  . 


\ 
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Multivariable  Nyquist  Stability  Criterion  : 


The  multivariable  Nyquist  stability  criterion  relates  the 
of  the  congiguration  of  Fig. 5  to  the  characteristic  gain  loci  for 
G (s )  H  (s) 


closed-loop  stability 
the  loop-gain  matrix 


o 


Figure  5 


Suppose  the  system  of  Fig.  5  has  no  uncontrollable  and/or  unobservable  modes  whose 
corresponding  characteristic  frequencies  lie  in  the  right-half  s-plane.  Then  this 
feedback  configuration  will  be  closed-loop  stable  if  and  only  if  the  net  sum  of  anti¬ 
clockwise  encirclements  of  the  critical  point  (-1+jO)  by  the  set  of  characteristic  gain 
loci  of  G(s)H(s)  is  equal  to  the  total  number  of  right-half-plane  poles  of  G(s)  and  H(s). 

Proofs  of  multivariable  versions  of  the  Nyquist  Stability  Criterion  have  been 
given  by  Barman  and  Katzenelson  [5]  and  MacFarlane  and  Postlethwaite  [4]  ,  and  Postle- 
thwalte  and  MacFarlane  [3]  following  earlier  heuristic  approaches  by  Bohn  and  Kasvand 
[6]  and  MacFarlane  [7]  . 

2 . 3 _ Physical  Interpretation  of  Characteristic  Gains 

The  idea  of  a  characteristic  frequency  plays  a  fundamental  role  in  applied 
dynamics.  It  is  associated  with  an  invariant  subspace  in  a  dynamical  system's  state 
space.  This  invariant  subspace  is  associated  in  turn  with  a  characteristic  pattern  of 
dynamical  behaviour  (a  mode)  and  with  an  eigenvalue  of  a  matrix  mapping  the  state  space 
into  itself;  this  eigenvalue  is  the  characteristic  frequency  associated  with  the  mode. 

A  characteristic  gain  is  associated  with  an  invariant  subspace  in  the  input  space 
to  a  feedback  loop.  This  input  space  is  a  space  of  complex  vectors.  In  physical  terms 
one  can  set  up  a  correspondence  between  the  set  of  all  complex  vectors  and  the  set  of 
exponential  vector  waveforms  of  a  given  complex  frequency.  With  reference  to  datum 
values  of  amplitude  and  phase  for  a  given  reference  time,  the  components  of  a  complex 
signal  vector  then  define  the  relative  amplitudes  and  phases  of  the  various  signal  compon¬ 
ents  which  it  represents.  Now  consider  the  situation  shown  in  Fig.  6. 
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Figure  6 


If  the  loop  gain  transmittance  is  G(s)  and  if  the  injected  signal  vector  w,(s)  is  an 
eigenvector  of  G(s)  for  the  complex  frequency  s,  then  the  returned  signal  will  be 
gislWjfs)  where  g,  (s)  is  the  appropriate  characteristic  gain  of  G(s).  Hence  in  signal 
space1 this  returned  signal  vector  will  lie  in  the  same  one-dimensional  linear  manifold 
as  the  input  signal  vector.  Therefore  one  would  expect,  as  stated  in  the  Generalized 
Nyquist  Stability  Criterion,  that  characteristic  frequencies  associated  with  closing  the 
loop  through  a  unit  negative  gain  must  be  such  that  the  characteristic  gain  function 
satisfies  the  condition  ; 


g(s)  =-i . 


So  both  characteristic  frequencies  and  characteristic  gains  are  associated  with 
invariant  subspaces.  In  the  case  of  characteristic  frequency  this  is  a  linear  manifold 
along  which  state  evolves  in  a  state  space.  In  the  case  of  characteristic  gain  it  is  a 
linear  manifold  of  signal  vectors  along  which  injected  and  returned  signal  vectors  both 
lie  in  an  input  space. 

2 . 4 _ Multivariable  Root  Loci 


The  loci  of  open-loop  characteristic  gain  with  frequency  are  generalisations  of 
classical  Nyquist  diagrams  in  the  complex  gain  plane;  there  is  a  corresponding  general¬ 
isation  of  the  classical  Evans'  Root_  Locus  diagram  in  the  frequency  plane.  These  are 
the  characteristic  frequency  loci  which  are  traced  out  by  the  eigenvalues  of  S  ( g )  as  g 
traverses  the  negative  real  axis  in  the  gain  plane.  They  can  also  oe  regarded  as  the 
180°  phase  contours  of  g(s)  on  the  frequency  plane.  The  theory  behind  the  multivariable 
root  locus  has  been  given  by  Postlethwaite  and  MacFarlane  [3]  and  their  use  discussed  by 
Retallack  [81  ,  Kouvaritakis  and  Shaked  [91  ,  MacFarlane,  Kouvaritakis  and  Edmunds  [10], 
Kouvaritakis  and  Edmunds  [11]  ,  Kouvaritakis  [12]  ,  and  Owens  [13]  .  Many  examples  of 
generalised  Nyquist  and  Root-Locus  diagrams  are  given  in  [2]  ;  a  complete  review  of  the 
development  of  frequency- response  methods  is  given  in  [1]  . 


2 . 5 _ Conformal  Nature  of  Mapping  Between  Frequency  and  Gain 


The  algebraic  functions  g(s),  giving  characteristic  gain  as  a  function  of 
frequency,  and  s(g),  giving  characteristic  frequency  as  a  function  of  gain  contain 
essentially  the  same  information.  One  is  simply  a  "re-packaged"  version  of  the  other. 

One  would  thus  expect  there  to  be  an  exact  structural  relationship  between  the  (generalised) 
Nyquist  and  (generalised)  Root  Locus  diagrams  for  a  given  feedback  loop.  These  structural 
relationships  also  reflect  the  fact  that  the  mappings  between  s-plane  and  g-plane  and  vice- 
versa  are  conformal,  so  that  appropriate  angular  relationships  are  preserved.  In  cor¬ 
relating  the  main  features  of  the  two  forms  of  diagram,  the  following  rule  has  been  found 
useful . 


Locus-Crossing  Rule  :  Let 
under  g(s)  in  the  g-plane.  Also 
image  under  s(g)  in  the  s-plane. 


D  be  a  path  in  the  s-plane  and  let  D  be  its  image 
let  E  be  a  path  in  the  g-plane  and  let  E1  be  its 
Then  the  following  relationships  hold. 


(i)  Each  crossing  of  E 
and  vice  versa. 


by  D1 


corresponds  to  a  related  crossing  of 


D  by  E1  , 


(ii)  If  E1  crosses  D  from  left  to  right,  with  respect  to  a  given  orientation 
for  0,  then  E  will  cross  D1  from  left  to  right  with  respect  to  the  induced  orienta¬ 
tion  for  D1,  and  vice  versa.  (The  given  orientations  for  D  and  E  are  taken  to 
induce  appropriate  orientations  in  and  E1  :  that  is  a  traversal  of  D  by  s  in  a 
positive  orientation  makes  g(s)  traverse  D1  in  a  positive  orientation,  with  a  similar 
convention  for  E  and  E1.) 


( iii ) 

crosses  E. 


The  angle  at  which  E1  crosses 


is  the  same  as  the  angle  at  which  D1 


This  rule  is  illustrated  in  Fig. 7,  taking  D  as  the  positive  imaginary  axis  in 
the  s-plane  and  E  as  the  negative  real  axis  in  the  g-plane. 


E,  PORTION  OF 
ROOT  LOCUS 
D.  traced  from  u)=0  to  U)=oo 


E  traced  from 
g=oo  to  g=0 


Dl  PORTION  OF 
NYQUIST  LOCUS 


Figure  7 


This  shows  that  a  crossing  of  the  negative  real  axis  in  the  gain  plane  (at  g*  say)  by  a 
portion  of  the  generalised  Nyquist  diagram  corresponds  to  the  migration  of  a  closed-loop 
characteristic  frequency  from  left  to  right  (under  negative  feedback  of  amount  ( g ^ ) —  ■*- 
applied  equally  in  all  loops). 
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3.  PRINCIPAL  GAINS  AND  PRINCIPAL  PHASES 


A  natural  way  to  look  at  the  gain  of  an  operator  representing  a  multivariable 
system  transmittance  is  in  terms  of  the  ratio  of  the  norm  of  an  output  vector  to  the 
norm  of  a  corresponding  input  vector.  Thus  if 

y(s)  =  G { s)  u ( s )  (3.1) 


one  may  define  the  vector  gain  (or  simply  the  gain)  of  G(s)  for  input  u(s)  as 


gain  G|  u  = 


(3.2) 


where  |j.|| denotes  the  standard  Euclidean  vector  norm.  In  these  terms  one  can  see  that 
the  characteristic  gains  of  G(s) ,  despite  their  useful  role  in  closed-loop  stability 
analysis,  do  not  give  an  adequate  description  of  the  gain  behaviour  of  an  operator.  For 
example  the  matrix  transfer  function 


f  0  0 


G  ( s )  = 


3+1 

s+2 


0 


has  characteristic  gains  which  are  identically  zero  for  all  values  of  s,  yet  it  obviously 
does  not  have  zero  gain  for  all  non-zero  inputs.  For  this  reason  characteristic  de¬ 

compositions  of  an  operator  are  not  well  suited  to  the  discussion  of  the  performance  of 
feedback  systems,  and  another  form  of  operator  decomposition  is  needed  which  is  more 
suited  to  the  accurate  discussion  of  gain  behaviour.  Such  a  decomposition  is  found  in 
terms  of  the  singular  values  of  an  operator,  and  there  is  currently  widespread  interest 
in  investigating  the  role  of  singular  values  in  the  analysis  and  design  of  multivariable 
feedback  systems  [14], [15]  ,  [16]  ,  [17]  ,  and  [18]  . 

It  is  natural  to  view  the  discarding  of  phase  information  which  accompanies  a 
singular  value  decompostion  with  some  regret  and  thus  to  seek  some  way  of  retaining  a 
measure  of  phase  information.  This  can  be  done  by  making  use  of  the  polar  decomposition 
of  an  operator  using  one  half  of  the  polar  decomposition  to  provide  gain  information  (as 
in  the  singular  value  decomposition)  and  the  remaining  part  to  give  phase  information. 
Since  the  term  "singular  phase"  falls  strangely  on  the  ear,  and  since  the  term  gain 
is  more  immediately  relevant  to  physical  concepts  than  the  (admittedly  well-established) 
mathematical  term  singular,  one  can  adopt  the  terms  principal  gain  and  principal  phase 
in  this  characterization  of  operators  [17]  ,  [18]  . 

Any  square  complex  matrix  G  can  be  represented  in  the  forms 

G  =  UHr 

G  =  HlU 

where  U  is  unitary  and  H„,H  are  positive  semi -definite  Hermitian  matrices.  HR  and 
are  often  called  the  right  and  left  moduli  of  G  and  are  given  by 


where  G*  is  the  complex  conjugate  transpose  of  G.  U  is  then  uniquely  defined  by  (3.3) 

or  (3.4)  in  case  G  is  non-singular.  Using  these  polar  decompositions  one  can  make 
the  following  definitions: 

(i)  the  principal  gains  of  G  are  the  eigenvalues  of  the  Hermitian  part  of 
its  polar  decomposition:  and 

(li)  the  principal  phases  of  G  are  the  arguments  of  the  eigenvalues  of  the 
unitary  part  of  its  polar  decomposition  [18]  . 

Since  the  non-zero  eigenvalues  of  G*G  and  GG*  are  the  same  either  the  left  or 
right  Hermitian  factor  may  be  used  in  determining  the  principal  gains.  Furthermore  the 
principal  gains  are  the  same  as  the  singular  values  of  G.  These  principal  gains  and 
principal  phases  are  related  to  the  characteristic  gains  in  the  following  way  [18]  : 

(i)  The  magnitudes  of  the  characteristic  gains  of  a  complex  matrix  G  are 
bounded  above  and  below  by  the  corresponding  maximum  and  minimum  principal  gains. 

(ii)  If  the  principal  phases  of  a  complex  matrix  G  have  a  spread  of  less  than 
x  radians  then  the  arguments  of  the  characteristic  gains  of  G  are  bounded  above  and 
below  by  the  maximum  and  minimum  principal  phases. 

If  the  zeros  of  G(s)  are  defined  in  the  usual  way  in  terms  of  its  Smith-McMillan 
decomposition  [19]then  one  can  show  that  the  vector  gain  of  G(s)  vanishes  for  s  =  z  if 
and  only  if  z  is  a  zero  of  G(s)  [17]  . 


(3.3) 

(3.4) 


Principal  gain  decompositions  are  useful  in  analysing  the  performance  of 
feedback  systems.  For  example  Daniel  [201  has  used  them  to  show  how  in  a  regulator 
the  use  of  rank-deficient  feedback  can  lead  to  disturbance  amplification. 

Using  a  combination  of  principal  gains  and  principal  phases  one  can  develop  a 
Nyquist-type  sufficient  stability  criterion  which  can  be  used  to  characterize  the 
robustness  of  closed-loop  stability  when  the  system  model  is  subjected  to  a  linear 
perturbation  (of  either  multiplicative  or  additive  form)  at  any  point  in  the  feedback 
loop  0.8]  .  Such  an  approach  gives  results  which  are  much  less  conservative  than  those 
obtainable  using  the  Small  Gain  Theorem  (since  this  neglects  phase  aspects  of  the 
problem) . 

For  an  open-loop  mxm  gain  matrix  G(s)  it  is  clear  from,  properties  (i)  and  (ii) 
above  that  for  any  complex  frequency  s  =  jw  a  curvilinear  rectangle  can  be  drawn  from 
the  maximum  and  minimum  principal  values,  within  which  the  corresponding  m  values  of 
the  characteristic  gain  loci  must  lie.  Should  the  principal  phases  have  a  spread 
greater  than  n  then  the  characteristic  gains  lie  inside  an  annular  region  determined 
by  the  maximum  and  minimum  principal  gains.  If  these  rectangles  or  annular  regions 
are  constructed  for  values  of  s  around  the  Nyquist  D-contour  then  a  region  will  be 
outlined  in  the  gain  plane,  which  we  shall  call  the  principal  region,  within  which  the 
characteristic  gain  loci  must  lie  as  shown  in  Fig. 8.  Application  of  the  generalized 
Nyquist  stability  criterion  to  this  region  leads  to  the  stability  criterion  of  section  4 
below. 


3.1  Relationships  between  the  characteristic  gains  and  the  principal  gains  and 

phases  of  a  complex  matrix. 

In  this  sub-section  two  important  theorems  are  presented  which  relate  the 
characteristic  gains  to  the  principal  gains  and  phases  of  a  complex  matrix,  T.  The 
theorems  are  crucial  to  the  stability  criterion  of  the  next  section. 

Theorem  1  The  magnitudes  of  the  characteristic  gains  of  a  complex  matrix  T  are 
bounded  above  and  below  by  the  corresponding  maximum  and  minimum  principal  gains. 
Although  this  theorem  is  well  known  (see  for  example  1171  or  (211)  it  is  included  in 
the  main  text  because  of  its  importance  in  the  development  of  the  major  results  of  this 
paper  and  also  to  complement  theorem  2  which  is  unfamiliar . 
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Proof 

Let  T*Tmi  =  y^rai  i  =  1,2,... ,m  (3.7) 

where  T  is  a  complex  m*m  matrix,  the  (m.)  are  an  orthonormal  set  of  eigenvectors  for 
T*T,  and  the  are  the  principal  gains  of  T,  numbered  so  that  YjT  Ypt .  •  •  sYm- 

Also  let  Tv^  =  i  =  1,2,. .  .  ,m  (3.8) 

where  the  fv.)  are  a  set  of  eigenvectors  for  T  and  the  (X  )  are  the  corresponding 
eigenvalues.  1 


Any  "input"  vector  u  can  be  represented  in  the  form 

m 

u  =  l  c^i  ci  e  (C 


i=l 


and  therefore 

li Tu | I 


l  C  * 

u  |  I  2  u  u 


m  |  2 

I  lci' 

i  =  l 


l  u*T*Tu  “  f.Y2 

1  —  -  1=1  =  11 


(3.9) 


(3.10) 


where  f.  = 
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I  ci  ' 
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l  |c 

1=1 


,  f .  -0,  and  7  £  .  =  1  . 
2  1  i=l  1 
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That  is,  the  ratio 
implies  that 


! |Tu| ] 
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2  2 
—  is  a  convex  combination  [21]  ,  of  the  {y.^}  which 


i  Tu  | 


rl  - 


i  1  tl  |  |  2 


-  y_  for  all  u. 
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In  particular  for  u  =  v^  ,  | |Tu| =  |X^|  ,  and  therefore 

l  M  l2 

Yi  s  I  xi  I  »  7m 


(3.11) 


(3.12) 


Theorem  2.  If  the  principal  phases  of  a  complex  matrix  T  have  a  spread  of  less  than 
180  degrees  then  the  arguments  of  the  characteristic  gains  of  T  are  bounded  above  and 
below  by  the  maximum  and  minimum  principal  phases. 

This  theorem  is  a  special  case  of  a  more  general  result  by  All  R.Amir-Mogz  and 
Alfred  Horn  122  ,  theorem  4,  page  745). 

Proof 

Let  T  have  the  polar  decomposition 

T  =  UHr  (3.13a) 

let  Tv  =  Xv  (3.13b) 

where  v  is  an  eigenvector  of  T  and  X  a  corresponding  eigenvalue,  and  let 

Uz^  =  p^z^  i  =  l,2,...,m  (3.14) 

where  the  (z,)  are  an  orthonormal  set  of  eigenvectors  for  U  and  the  !  are  the  corres¬ 
ponding  eigenvalues,  the  arguments  of  which  are  the  principal  phases  of  t.  Then 

a-Hrv,  v>  =  < HpV ,  Tv  >  =  v*Hr*UHrv 


v  H 


AWi* 


i  =  1 


Hrv 


J1‘VHRV'  Zi"V 


Hrv> 


=  zt:\‘ 


(3.15) 


MO 


T  has  implicitly  been  assumed  nonsingular  and  consequently  so  has  H  which  is 
also  Hermitian  by  definition.  It  follows  therefore  that  • HRv,  v-  is  both  real  and 
greater  than  zero  which,  with  (3.15)  ,  implies  that  1  lies  in  the  convex  cone  generated 
by  the  ( }  .  If  the  cone  is  not  convex,  that  is  the  principal  phases  have  a  spread 
greater  than  180  degrees,  we  cannot  say  anything  about  the  argument  of  A  .  A  convex  cone 
1151  is  a  set  of  numbers  in  the  complex  plane  which  are  closed  under  linear  combinations 
with  nonnegative  coefficients.  I f  we  consider  drawing  an  arc  in  the  complex  plane, 
centre  the  origin,  in  an  anti-clockwise  direction,  then  the  angle  at  which  the  arc  enters 
the  convex  cone  is  referred  to  as  the  minimum  principal  phase  and  the  angle  at  which  the 
arc  leaves  the  cone  is  the  maximum  principal  phase.  We  have  therefore  shown  that  the 
arguments  of  the  characteristic  gains  of  T  are  bounded  above  and  below  by  the  corres¬ 
ponding  maximum  and  minimum  principal  phases. 

4.  A  SUFFICIENT  STABILITY  CRITERION 

In  the  generalised  Nyquist  stability  criterion  151,  141  and  1231  ,  the  stability 
of  a  linear  multivariable  feedback  configuration,  see  Fig. 9,  is  determined  from  the  char¬ 
acteristic  gain  loci  which  are  a  polar  plot  of  the  eigenvalues  of  the  open-loop  gain 
matrix  G(s),  evaluated  as  s  traverses  the  standard  Nyquist  D-contour.  This  criterion 
can  be  stated  as  follows  : 

the  closed-loop  system  is  stable  if  and  only  if  the  number  of  anti-clockwise 

encirclements  of  the  critical  point  ( - 1^  +  jO)  by  the  characteristic  gain  loci 

is  equal  to  the  number  of  open-loop  unstable  poles. 


Figure  9  Multivariable  feedback  configuration 


It  is  now  shown  how,  using  theorems  1  and  2,  a  sufficient  Nyquist-type  stability 
criterion  can  be  obtained  in  terms  of  the  principal  gains  and  phases  of  G(jw). 

For  an  open-loop  gain  matrix  G(s)  it  is  clear  from  theorems  1  and  2  that  for 
any  complex  frequency  s=jw  a  curvilinear  rectangle  can  be  drawn  from  the  maximum  and 
minimum  principal  values,  within  which  the  corresponding  m  values  of  the  characteristic 
gain  loci  (generalised  Nyquist  diagrams)  lie.  Should  the  principal  phases  not  form  a 
convex  cone  then  the  characteristic  gains  lie  inside  an  annular  region  determined  by  the 
maximum  and  minimum  principal  gains.  If  these  rectangles  or  annular  regions  are  con¬ 
structed  for  values  of  s  around  the  Nyquist  D-contour  then  a  region  will  be  outlined 
in  the  gain  plane,  which  we  shall  call  the  principal  region,  within  which  the  character¬ 
istic  gain  loci  must  lie;  see  Fig.  8.  Application  of  the  generalised  Nyquist  stability 
criterion  to  this  region  now  leads  to  the  following  stability  criterion: 

the  closed-loop  system  is  stable  if  m  times  the  number  of  anti-clockwise 
encirclements  of  the  critical  point  (-j^  +  jO)  by  the  principal  region  is  equal 
to  the  number  of  open-loop  unstable  poles. 

It  also  follows  that: 

the  closed-loop  system  is  unstable  if  m  times  the  number  of  anti-clockwise 
encirclements  of  the  critical  point  (-j~  +  jO)  by  the  principal  region  is  not 
equal  to  the  number  of  open-loop  unstable  poles. 

When  the  critical  point  lies  inside  the  principal  region  neither  stability  nor  instability 
can  be  inferred. 


Since  the  above  are  only  sufficient  conditions  for  stability  or  instability,  as 
straightforward  stability  tests  they  are  inferior  to  the  generalised  Nyquist  stability 
criterion.  However,  the  generalised  Nyquist  stability  criterion  only  determines 
stability  with  respect  to  a  single  gain  common  to  all  the  loops  and  as  such  does  not 
reliably  characterize  the  robustness  of  the  stability  property  when  the  system  is 
subjected  to  arbitrary  perturbations.  In  the  next  section  it  is  shown  how  a  new 
sufficient  Nyquist-type  stability  criterion  can  be  used  to  characterize  the  robustness 
of  a  linear  multivariable  design  when  it  is  subjected  to  linear  perturbations. 

5.  ROBUST  STABILITY 

One  of  the  major  reasons  for  using  feedback  control  as  opposed  to  open-loop  is 
the  presence  of  model  uncertainties  and  hence  the  desire  to  keep  the  system  stable  under 
large  parameter  variations.  However,  the  mere  presence  of  feedback  is  not  sufficient 
to  guarantee  the  robust  stability  of  a  system  [24  ]  .  Research  in  this  area  has 
recently  [25],  f26al  ,  and  [27  1  centred  around  finding  conditions  for  which  a  feedback 
system  will  remain  stable  when  the  open-loop  gain  matrix  G(s)  is  subjected  to  a  multi¬ 
plicative  or  additive  perturbation  AG ( s)  .  It  has  been  shown  that  the  feedback  con¬ 
figurations  of  figures  10  and  11  remain  stable  when  G(s)  is  stable  and 

|  |  AG { j u)  |  |  <  - - - - - r— 5 -  for  all  w  (5.1) 

I  l[l  +  G<  jw>  ]  |  | 

in  the  multiplicative  case,  and 

I  I  AG ( jw  II  <  - - - 1---  ■■ -  for  all  u  (5.2) 

I  I  [i  +  G(jo)J'1|  ) 

in  the  additive  case.  These  relationships  are  most  easily  derived  from  a  simple 
application  of  the  small  gain  theorem, [ 27 ) ,  [28]  and  [53]  ,  after  first  rearranging  the 
block  diagrams.  If  the  spectral  norm  ||.||_  is  taken,  then  the  robustness  of  each  con 
figuration  is  characterized  by  the  maximum  principal  gain  (singular  value)  of  the  appro¬ 
priate  frequency  response  matrix. 


Figure  10  Feedback  configuration  with  multiplicative  pertubation 


Figure  11  Feedback  configuration  with  additive  pertubation 

We  shall  now  examine  how  the  new  stability  criterion  of  section  4  might  be  used 
to  characterize  robustness.  Motivated  by  the  inclusion  of  phase  information,  the  aim 
is  to  obtain  less  conservative  bounds  on  a  linear  perturbation  than  those  furnished  by 
the  small  gain  theorem. 

5.1  Gain  and  phase  margins 

To  use  the  new  stability  criterion  in  the  context  of  robustness  we  firstly  re¬ 
arrange  figures  10  and  11  to  those  in  figures  12  and  13  respectively,  and  make  the 
observation  that  the  system  in  figure  10  is  stable  if  and  only  if  the  system  of  figure 
12  is  stable,  and  likewise  for  figures  11  and  13.  The  arguments  that  follow  are 


essentially  the  same  for  both  the  multiplicative  and  additive  case  and  so  for  simplicity 
we  will  consider  only  the  multiplicative  case. 


Figure  12  Rearranged  feedback  configuration  with 
multiplicative  pertubation 


Figure  13  Rearranged  feedback  configuration  with 
additive  pertubation 


We  now  construct  the  principal  region  for  fl  +  G(jw)  1  and  ask  the  question: 
Can  we  deduce  from  the  principal  gains  and  phases  of  ftGtjw)  a  deformed  principal  region 
within  which  the  eigenloci  (characteristic  gain  loci)  of  AG(jw)ri  +  G(jtjj)~l]-1  must  lie? 

If  this  is  so  then  we  will  be  able  to  specify  conditions  on  AG(jw)  for  which  the  perturbed 
system  will  remain  stable.  The  answer  is  yes,  and  the  conditions  on  AG(jw)  for  which  the 
system  remai  a  stable  are  summarised  in  the  following  theorems,  after  first  intro¬ 

ducing  some  notation. 


Let  I  I  +  G( ju)'1  1_1 


have  principal  gains  and  phases 


*  ^  ( w )  *■  a  2  (  <*j  )  • 


ft  A  ( oi )  $2  ( <4J )  ■' '  •  •  •  <  ( w ) 


respectively,  where  the  numbering  of  the  (0.(u)}  is  such  that  the  notions  of  maximum  and 
minimum  phases  are  as  defined  at  the  end  of  section  3;  it  is  assumed  that  the 
have  a  spread  of  less  than  71.  Similarly  let  the  principal  gains  and  phases  of 
AG(jus)  be  4A(uj)'-A2(u)-...r6m(w)  (5.5) 


_  A  ( ui )  *”  F. 2  ( uj )  ^  ■  ■  •  ' 


respecti ve ly ,  where  the  phases  {c.(w)}can  be  thought  of  as  negative  or  positive  arguments 
corresponding  to  unmodelled  phase  lag  or  advance  characteristics.  Also  let  the  condition 
numbers  [26b]  for  [I  +  G  ( juj)  P*  and  AG(joi),  using  the  ^-induced  norm,  be  c,  (ui)and  c7(u), 
respectively,  so  that  z  L 


A  fXni(a)J  a  6  (w) 

Cl(":)  =  (u>)  and  C2<u)  ' 


and  also  define 


if  (>■')  -  tan  ^ 
m 


|  c1 («) -1 1 c 2 (w> 
1-r c1 (W) -1 lc2 (w) 


which  will  be  referred  to  as  a  phase  modifier. 
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We  can  now  state  and  prove  the  following  theorems. 

Theorem  3  (Small  Gain  Theorem) :  The  closed-loop  system  remains  stable  under  a 

multiplicative  perturbation  AG(s)  if 

a)  AG(s)  is  stable  and 

b)  &  (w)  a _  ( ai)  <  1  for  all  tn  . 

m  m 

This  is  exactly  the  small  gain  theorem  introduced  by  Sandberg  127]  and  Zames  128]  for  a 
more  general  class  of  systems.  The  result  is  often  conservative,  especially  in  our 
case  where  we  are  considering  linear  systems,  but  the  conservativeness  can  be  reduced  if 
we  consider  the  principal  phases  as  described  in  the  next  theorem. 

Theorem  4  (Small  Phase  Theorem) :  The  closed-loop  system  remains  stable  under  a 

multiplicative  perturbation  AG(s)  if 

a)  AG(s)  is  stable, 

b)  (e^dj)  +  i,  j  =  l,...,m)  have  a  spread  of  less  than  it  for  all  w, 

c)  Ic1(ui)  -  1  )c2  (u)  <  1  for  all  o, 

d)  E^(w)  +  Oj(u>)  -  4'm!w)>  -  ti  for  alio,  and 

e)  t  (w)  ♦  8  (o)  +  4/„(u))t  tt  for  all  w  . 

m  m  m 

Note  that  conditions  d)  and  e)  are  analogous  to  the  phase  margin  concept  used  in  the 
analysis  of  single-input-single-output  systems  modulo  the  phase  modification  gj  ( u< )  . 

In  general,  Tneorem  4  is  more  applicable  at  low  frequencies  when  9,(w)  is  a  small 
lag.  At  higher  frequencies,  when  p^(ou)  approaches  -  it  and  the  principal  region  is  in 
the  vicinity  of  the  “-1"  point,  then  Theorem  3  is  more  useful.  Consequently,  for  a  more 
practical  characterization  of  the  robustness  of  the  stability  property  over  the  whole 
frequency  range,  Theorems  3  and  4  are  combined,  as  in  Theorem  5  below. 

Theorem  5:  The  closed- loop  system  remains  stable  under  a  multiplicative  perturba¬ 
tion  AG  ( s )  if,  for  some  frequency  ujfa  , 

a)  the  conditions  of  Theorem  4  are  satisfied  in  the  frequency  range  [0,u^j  and 

b)  the  conditions  of  Theorem  3  are  satisfied  in  the  frequency  range  (^,“>1. 

The  proofs  of  Theorems  3  and  4  are  given  in  1181;  essentially  they  are  based  on 
the  following  ideas.  In  Theorem  3  we  restrict  the  moduli  of  the  characteristic  gain 
loci  (generalised  Nyquist  diagrams)  to  values  less  than  1,  so  we  can  have  no  encircle¬ 
ments  of  the  critical  point  -  1,  and  hence  closed-loop  stability  regardless  of  the  phases 
involved.  In  Theorem  4  we  prevent  the  characteristic  gain  loci  from  crossing  the  negative 
real  axis  by  restricting  their  arguments,  so  again  we  can  have  no  encirclements  of  the 
critical  point,  and  hence  closed-loop  stability  regardless  of  the  associated  moduli. 

In  Theorem  5  we  prevent  the  characteristic  gain  loci  from  encircling  the  critical 
point  by  restricting  their  arguments  in  the  frequency  range  IO,u)b-  and  restricting  their 
moduli  in  the  frequency  range  J.  The  proofs  of  Theorems  ?  and  4  therefore  show 

that  Theorem  5  holds,  and  hence  no  formal  proof  of  Theorem  5  is  given. 

Interpretation  of  Theorems  3,  4  and  5:  Theorem  3  tells  us  that  stability  is 

maintained  for  a  stable  linear  perturbation  AG(s)  if  the  spectral  norm  of  the  perturba¬ 
tion  frequency  response  is  always  less  than  a  value  determined  from  the  system  model. 
Therefore,  in  the  polar  decomposition  for  AG(jui)  there  is  an  upper  limit  on  the  maximum 
principal  gain  at  each  frequency,  but  no  restriction  at  all  on  the  principal  phases. 

Theorem  4  shows  us  how  extra  gain  can  be  accommodated  in  the  perturbation 
providing  certain  phase  requirements  are  satisfied.  Conditions  b)  and  c)  are  mathematic¬ 
al  requirements  for  the  definition  of  the  phase  modifier  i|rm(w)  and  the  development  of 
the  phase  conditions  d)  and  e) ;  see  1181  .  More  precisely,  condition  b)  implies  that 
at  any  frequency,  the  two  sets  of  principal  phases,  one  for  AG(jw)  and  the  other  for 
I  I  +  G  ( j  w)  ,  each  lie  in  a  convex  cone,  that  is,  have  a  spread  less  than  x  ,  and  also 

that  the  "sum"  of  these  convex  cones  is  convex.  The  sum  of  two  cones  is  here  defined 
as  the  cone  formed  from  the  products  of  all  complex  numbers  in  one  cone  with  thos  in 
the  other.  The  term  "sum"  is  used  because  the  sum  of  two  convex  cones  is  clearly 
defined  by  summing  the  principal  phases  of  one  cone  with  those  of  the  other.  Condition  c) 
restricts  the  condition  numbers  of  c^(w)  and  c^  ( w)  ;  c^  (w)  is  required  to  be  less  than  2 

and  c2(w)  less  than  lMc,  (u|  -  1)  .  In  practice,  for  a  design  which  aims  at  accurate 
tracking  through  approximate  diagonalization,  c,  (w)  will  be  about  1,  allowing  a  large 
value  for  C2(w)»  the  condition  number  of  the  perturbation.  Note,  however,  that  a 
large  value  of  C2(w)  corresponds  to  a  large  value  of  Tt/  ( ui )  which  from  condition  d) 
imposes  a  greater  restriction  on  e  (id)  as  a  phase  lag  In  the  perturbation.  As  c2(w)  is 
reduced,  the  phase  lag  in  the  perturbation  can  be  increased,  that  is,  condition  number 
can  be  traded  for  phase  and  vice  versa.  It  should  be  noted  that  c2(w)<  °°  •implies  that 
AG (  j'-j)  is  nonsingular. 
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5.2  Automatic  Scaling 


Mees  and  Edmunds  [29]  have  recently  developed  and  implemented  on  the  Cambridge 
multivariable  design  package  an  algorithm  which  effectively  finds  a  real  diagonal  matrix 
S  such  that  S”^T(jw)S  is  as  near  diagonal  as  possible  over  the  whole  frequency  range. 
This  automatic  scaling  technique  can  be  used  in  the  study  of  stability  with  respect  to  a 
perturbation  ^G(s) ,  as  is  now  explained  for  the  "multiplicative"  case. 


Instead  of  constructing  the  principal  region  for  11+  Gtjw)-1]-  ,  let  us  consider 
the  principal  region  for  S~1[I  +  G(juj)  — 1  J— 's.  This  principal  region  will,  in  general, 
have  a  narrower  band,  and  since  Fig  12  can  be  redrawn  as  shown  in  Fig. 14  we  see  that  it 
predicts  stability  with  respect  to  the  perturbation  S_1AG(s)S.  The  scaling  matrix  S 
therefore  tells  us  which  elements  of  AG(s)  are,  relatively  speaking,  the  most  sensitive. 


Figure  14  Feedback  configuration  with  scaling 


5.3  Additive  Perturbations 

As  stated  earlier,  one  can  just  as  easily  consider  the  "additive"  case,  the  only 
essential  difference  being  that  the  principal  region  for  [I  +  G(jw)]-*-,  assuming  integral 
action,  is  in  the  neighbourhood  of  the  origin  at  low  frequencies  and  in  the  neighbour¬ 
hood  of  one  at  high  frequencies.  In  the  additive  case,  therefore,  the  small  gain  theo¬ 
rem  is  more  applicable  at  low  frequencies  and  the  small  phase  theorem  is  more  applicable 
at  high  frequencies,  in  contrast  to  the  multiplicative  case.  In  general,  a  linear  per¬ 
turbation,  multiplicative  or  additive,  can  be  considered  at  any  point  in  a  feedback 
system  by  constructing  the  appropriate  principal  region. 

6.  ZEROS 

The  characteristic  gain  function  g(s)  has  a  set  of  poles  and  zeros,  and  its 
related  generalised  Nyquist  and  Root-Locus  diagrams  have  distinctive  patterns  of  asymptot¬ 
ic  behaviour.  It  is  of  great  value  to  relate  all  these  to  the  state-space  model  para¬ 
meters  of  the  feedback  loop  dynamics.  The  poles  are  simply  a  subset  of  the  eigenvalues 
of  an  appropriate  A-matrix  if  the  state-space  model  involved  is  not  fully  controllable! 
normally  they  will  be  the  full  set  of  A-matrix  eigenvalues.  This  section  looks  at  the 
relationship  between  the  zeros  of  g(s)  and  the  state-space  structure,  and  Section  8  looks 
at  asymptote  behaviour. 

In  physical  terms  zeros  are  associated  with  the  vanishing  of  vector  gain;  that 
is  with  the  existence  of  non-zero  input  exponential  signal  vectors  which  result  in  zero 
output  1301.  The  discussion  of  zeros  in  the  literature  is  confusing  because  one  can 
talk  about  zeros  in  connection  with  various  objects  :  £(A,B,C,D),  or  G(s),  or  g(s)  for 
example.  The  object  J.'(A,B,C,D)  has  associated  with  it  a  larger  set  of  zeros  than  the 
object  G  ( s )  because  in  passing  from  the  representation  j:(A,B,C,D)  to  the  representation 
G(s)  one  discards  phenomena  (uncontrollability  and  unobservability)  which  can  be  discussed 
in  terms  of  zeros  (the  so-called  decoupling  zeros  [31]).  In  turn  the  object  G(s)  has  a 
larger  set  of  zeros  than  the  object  g(s)  because  vanishing  of  all  characteristic  gains 
does  not  necessarily  imply  the  vanishing  of  the  vector  gain  for  G(s).  For  the  purposes 
of  this  summary  we  will  take  the  system  representations  being  discussed  to  be  completely 
controllable  and  completely  observable,  and  the  feedback  loop  transmission  zeros  to  be 
those  values  of  s  obtained  by  considering  the  consequences  of  putting  g  =  0  in  a 
specific  way.  A  useful  physical  interpretation  of  a  zero  then  comes  from  the  following 
result  I  19  ]  . 

Transmission-blocking  theorem:  For  a  system  £(A,B,C,D)  having  a  number  of  in¬ 

puts  less  than  or  equal  to  the  number  of  outputs,  necessary  and  sufficient  conditions  for 
an  input  of  the  form 

u(t)  =  l<t)eztuz  (6.1) 

to  result  in  a  state-space  trajectory 
=  1  (t)eztxz  for  t  > 0 


x  ( t ) 


(6.2) 
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and  an  output 


y  (t)  n-  o  for  t  >  0  (6.3) 

are  that  a  complex  frequency  z  and  a  complex  vector  uz  exist  such  that 


(Here  1  ( t )  =  0  for  t  s  0  and  1  ( t )  =  1  for  t>0.) 

Hence  the  zeros  defined  in  this  way  are  the  roots  of  the  invariant  factors  of  an 
appropriate  pencil.  Tne  corresponding  vectors  uz  and  x,  are  called  the  zero 
directions  in  the  input  and  state  spaces  respectively  [19j 

Zero  Pencil:  From  (6.4)  we  have  that,  for  the  case  when  D  =  0, 


(zl  -  A)  x 


Cx_ 


Bu 


=  0 


Zero  directions  in  the  state  space  naturally  lie  in  the  kernel  of  C  so  that 
x  =  Mv 


(6.5) 

(6.6) 

(6.7) 


where  M  is  a  basis  matrix  representation  of  Ker  C  and  v  is  an  appropriate  vector  of 
constants.  Substitution  of  (6.7)  into  (6.5)  and  premultiplication  by  the  full-rank 
transformation  matrix  N .  "1  where  N  is  a  full-rank  left  annihilator  of  B(i.e.NB=0)  and 

LB+J  of 


a  left  inverse 


(z  NM-NAM)VZ  =  0 
uz  =  B ' ( zI-A) Mvz 


Df  B  (i.e.  B'B=X.  yields 


(6.8) 

(6.9) 


The  object  ( sNM-NAM)  is  called  the  zero  pencil  132)  ,  1331  for  the  system  1(A,B,C). 

The  zeros  are  the  roots  of  the  invariant  factors  of  the  zero  pencil,  and  this  fact  can  be 
used  to  exhibit  the  geometric  characterization  of  zeros  in  a  very  succinct  way.  To  do 
this  it  is  useful  to  use  Sastry  and  Desoer's  notion  of  restricting  an  operator  in  domain 
and  range  134]  . 

6.1  Restriction  of  an  operator  in  domain  and  range 

Given  a  linear  map  A  from  Cn  to  Cn  and  two  subspaces  l.  L  of  C.  n  ,  where 
cn  is  the  space  of  n-dimensional  complex  vectors,  then  the  restriction/of  A  to  in 

the  domain  and  S2  in  the  ran9e  is  defined  to  be  the  linear  map  which  associates  with 
X1  '*1  the  orthogonal  projection  of  Ax^  on  to  jfy  •  This  restricted  linear  map  (and 
also  its  matrix  representation)  will  be  denoted  By 

A  \  j|r*  J2  =A  (6.10) 

If  the  columns  of  Sj  and  S2  respectively  form  orthonormal  bases  for  and  ,/2  then  the 
matrix  representation  of  the  restricted  operator  is  given  by 


A  S2^  S1 

where  *  denotes  complex  conjugate  transposition, 
then  calls  the  roots  of 


det 


[S2  Sl>-  S2  A  Sl]  - 


(6.11) 

With  a  mild  abuse  of  language  one 

(6.12) 


the  spectrum  of  the  restricted  operator  A- 

6.2  Spectral  characterization  of  zeros 

* 

t  Suppose  we  choose  N  and  M  so  that  their  columns  are  orthonormal  bases  for 
Ker  B  and  Ker  C  respectively.  Then 


and 


NM  = 


NAM  =  A 


Xnl 

(Ker 

l^Ker 


C-Ker  B 


C-Ker  B 


(6.13) 


(6.14) 


and  one  may  say  that  : 

the  finite  zeros  of  £(A,B,C)  are  the  spectrum  of  the  restriction  of  A  to  Ker  C 
in  domain  and  Ker  B*  in  range. 
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6.3  Interpretation  of  zeros  and  zero  directions  in  terms  of  intersecting  manifolds. 

Let  J(s)  ^  (sI-A)-1B  (6.15) 

and  for  a  given  value  of  complex  frequency  s  consider  the  subspace  ImJ(s)c  JC  .  If  B  is 
of  full  rank  then  any  vector  xs<:  ImJ(s)  can  be  made  a  closed-loop  eigenvector  of 
E(A,B,C)  under  state-feedback;  also  if  xs  is  a  closed-loop  eigenvector  of  E(A,B,C) 
under  state- feedback  it  must  lie  in  ImJ(s) .  Now  it  is  a  known  property  of  closed-loop 
characteristic  frequencies  that,  under  feedback  of  unbounded  norm,  an  appropriate  number 
of  closed-loop  characteristic  frequencies  migrate  to  the  finite  zeros  of  the  loop  trans¬ 
mission  matrix  (35)  .  Since  for  arbitrarily  high  feedback  gain,  the  relevant  closed- 
loop  poles  will  coincide  with  the  zeros,  an  appropriate  number  of  closed- loop  modes  will 
have  become  unobservable  and  their  corresponding  closed-loop  eigenvectors  will  thus  lie 
in  Ker  C.  Hence  one  would  expect  that  the  zero  direction  for  a  finite  zero  z  would  be 
associated  with  the  intersection  of  ImJ(z)  with  Ker  C.  (Here  Im  denotes  image.) 

6.4  Finite  zeros  when  D  is  non-zero. 

Suppose  the  direct-coupling  operator  D  for  E(A,B,C,D)  having  m  inputs  and 
outputs  has  nullity  dQ.  Then  £(A,B,C,D)  has  at  most  (n-dQ)  finite  zeros  and  at  least  u 
root- locus  asymptotes  going  to  infinity  (to  infinite  zeros)  [101  .  Let  D  have  a 
characteristic  decomposition 

D  =  [U  M  ]  Jo  0  Vo  (6.16) 

o  °  ^ 


where  t  UQ  MQ )  and  ( Ng]*’  are  the  eigenvector  and  dual  eigenvector  matrices  for  D  and 
J  is  the  Jordan  block  associated  with  the  non-zero  eigenvalues  of  D.  Further  suppose 
that  there  is  a  complete  set  of  eigenvectors  spanning  the  null  space  of  D.  Define 
A0,bq,C0  by 


A  =  A  -  BU  vJ  V  C 
o  o  o  o 


Bc  =  BM. 


Then  the  finite  zeros  and  root  locus  asymptotes  of  E(A,B,C,D)  are  those  of  £ (Aq ,Bq ,CQ) . 

In  case  D  has  full  rank  one  has  that  there  are  n  finite  zeros  of  E(A,B,C,D) 

which  are  given  by  o(A-BD~(-C),  the  spectrum  of  the  matrix  (A-BD'^C)  . 

7.  BILINEAR  TRANSFORMATION  OF  FREQUENCY  AND  GAIN  VARIABLES 

If  £ (A ,B ,C ,D)  having  m  inputs  and  m  outputs  has  a  D  which  is  non-singular 

then,  as  noted  in  Section  6.4,  it  will  have  n  finite  zeros  given  by  o[A-BD_1C]  . 

Given  a  bilinear  transformation 

p  =  g-s.t_b  (7.1) 

F  c  s  +  d 

s  =  ~.jP  (7.2) 

cp  -  a 

on  the  complex  frequency  it  has  been  shown  b^  Edmunds  [36]  that  it  is  possible  to  find 
a  correspondingly  transformed  system  £(A,B,C,D)  such  that 

G(p)  =  G  ( s )  (7.3) 

Thus  if  the  original  system  has  a  zero  at  z  the  transformed  system  will  have 
a  zero  at  a  location  given  by  substituting  z  in  (7.1).  One  can  therefore  choose  an 
appropriate  ..bilinear  transformation  to  get  a  D  of  full  rank,  calculate  the  zeros  as 

a [ A  -B  6'*  C)  and  transform  back  to  find  the  complete  set  of  zeros  (finite  and 

infinite)  of  the  original  system.  The  bilinear  mapping  is  a  conformal  map  between  the 
s-plane  and  the  p-plane  for  all  points  except  p  =  a/c  which  gets  sent  to  infinity  on 
the  s-plane  and  s  =  -d/c  which  gets  sent  to  infinity  on  the  p-plane.  It  is  assumed 
that  (ab  -  bc)  /  0  and  that  the  original  system  has  no  pole  at  -  d/c.  The  set  of  trans¬ 
formed  system  matrices  is  given  by 

A  =  (cA  +  dl)-1  (aA  +  bl)  (7.4) 

B  =  (cA  +  dI)_1B  (7.5) 

C  =  (ad  -  bc) C (cA  +  dl)-1  (7.6) 


D  =  D  -  cC(cA  +  dl)  B 


g 


A  similar  bilinear  transformation  can  be  carried  out  on  the  complex  gain  variable 


a  3 
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q 


a  g  +  b 
c  g  +  d 


9  = 


c  q  -  a 

The  correspondingly  transformed  system 


£ (a1b}c1d}) 


is  such  that 


S  (q)  =  S (g) 


and  has  state-space  parameters  given  by 
A1  =  A  -  cB (CD  +  dl) -1C 
B1  =  (ab  -  be) B ( cD  +dl)_1 
C1  =  (ac  D  +  dl) -1C 
D1  =  (cD  +  dl ) - 1 ( aA  +bl) 

8.  GEOMETRIC  THEORY  OF  ROOT  LOCUS  AND  NYQUIST  DIAGRAMS 


(7.8) 

(7.9) 

(7.10) 


( 7 . 1  i  i 

(7.12) 

(7.13) 

(7.14) 


It  has  been  noted  that  the  finite  zeros  of  a  vector  feedback  loop  transmittance 
can  be  characterized  in  terms  of  the  spectrum  of  the  restriction  of  its  A-matrix  in 
domain  and  range.  The  idea  of  using  the  spectrum  of  a  restricted  operator  in  connection 
with  the  Markov  Parameters  of  a  loop  transfer  function  matrix  leads  to  a  geometric  treat¬ 
ment  of  the  high-gain  asymptotic  behaviour  of  generalised  Root-Locus  diagrams  and  the 
high-frequency  asymptotic  behaviour  of  generalised  Nyquist  diagrams. 


Let  the  transfer  function  matrix  for  a  strictly  proper  (i.e  D  =  0)  feedback  loop 
be  expanded  in  a  Taylor  series  about  s  =  ®  as 


where  {G^  :  i=l,2,....}  are  the  Markov  Parameters 

Gj^  =  CB  Gk  =  CAk_1B  k  =  2,3 .  (8.2) 

The  use  of  Markov  Parameters  to  relate  the  main  features  of  generalised  Root- 
Locus  diagrams  to  state-space  model  parameters  has  been  discussed  by  a  number  of  in¬ 
vestigators  [37  1  ,  [38]  ,  [391,  [40]  [34]. The  approach  and  notation  used  here  follows 
Sastry  and  Desoer  [ 34  ]  ,  who  give  a  good  discussion  of  the  assumptions  involved  in  a 
detailed  analysis  of  this  sort. 


Let  a  sequence  of  restricted  operators  be  defined  as  follows  : 


Gk  =  G^Ker  G^Ker 


(8.3) 


and  take  Y  =  G.  . 

G1  1 

v 

Let  d^  be  the  nullity  (rank  defect)  of  G^  . 

Then  the  high-gain  asymptotic  behaviour  of  the  various  branches  of  the  generalised  root- 
locus  diagram  can  be  determined  as  follows. 


1st  order  branches:  s^^  ^ 

where  1,  !  \  { 0 } 

1,1  G , 


i=l ,2, 


..(m-d^ 


(8.4) 


and  g  -O 

v 

The  collection  of  symbols  eo[G  j\{o)is  to  be  read  as  “belongs  to  the  non-zero  spectrum 
v  1 

of  G^  " 

2  1 

2nd  order  branches:  s“  ^  ~  ~  1,2  i=l,2, . .^dj-d^) 


where  X ^  2  E  ^  gJ  '  ^ 0 ^ 

and  g  O 

and  for  the  kth  order  branches 

slk;-^  1  *  1'2'-*'(dk  -  dk-l> 
Ai,kf-°rc^  \<o) 
and  g-»0 


(8.5) 


where 


(8.6) 


MS 


On  invoking  the  correspondence  between  the  high-frequency  asymptotes 
for  the  generalised  Nyquist  diagrams  and  the  high-gain  asymptotes  for  the  generalised 
Root-Locus  diagrams  one  then  gets  the  following  description  of  the  high-frequency  branches 
of  the  generalised  Nyquist  diagrams. 


1st  order 

branches : 

gi,l 

.  i ,  1 

jm 

(8.7) 

2nd  order 

branches : 

gi ,  2 

Xi,2 

<jw)2 

(8.8) 

k*"*1  order 

branches : 

gi,k 

-  Xi,k 
(jw)k 

(8.9) 

where 

Xi,kEoCS 

I\{0) 

as  before. 

and 

For  example  if  the  first  Markov  Parameter  CB  has  full  rank  m  then  it  can 
be  shown  that  the  state  space  is  the  direct  sum  of  the  image  of  the  input  map  B  and  the 
kernel  of  the  output  map  C: 

=  1m  B  ®  Ker  C  (8.10) 

and  that  the  zero  pencil  then  becomes 

(si  _  -  NAM) 
n-m 

There  are  then  (n  -  m)  finite  zeros,  given  by  a (NAM) ,  and  there  are  m  first-order 
asymptotic  branches  of  the  generalised  Root-Locus  and  Nyquist  diagrams  given  by  : 


and 


si(1  -  -a-^CBJ/g  g-0 


gi,l 


0A  (CB) 

jw 


1  =  1,2,  ....,m 


(8.11) 


where  (o^tCB)  :  i  =  1,2 . m}  are  the  eigenvalues  of  the  first  Markov  parameter  CB. 


The  numbers  of  the  various  types  of  asymptotic  behaviour  are  summarized  in 
the  following  table  (where  v is  the  first  integer  for  which  the  nullity  d^  vanishes) : 


Order 

Number  of 

Nyquist  Asymptotes 

Number  of  Root 
Locus  Asymptotes 

1 

m  - 

m  -  d^ 

2 

dl  -  d2 

2(d1-  d2) 

3 

d2  -  d3 

3(d2-  d3) 

V 

dv-l 

vdv-l 

Total 

Number  of 
Asymptotes 

m 

_ 

v-1 

m  +  £  d . 
i=l  1 

Adding  up  the  total  number  of  root-locus  asymptotes  and  subtracting  the 
result  from  the  dynamical  order  of  the  feedback  loop  we  find  that 

v-1 

Number  of  finite  zeros  =  n  -  m  -  £  d,  (8.12) 

i=l  1 

A  careful  study  of  the  implications  of  the  above  relationships  gives  a  vivid 
geometrical  insight  into  the  role  played  by  the  basic  state-space  model  operators  A,B, 
and  C  in  generating  the  main  structural  features  of  the  asymptotic  behaviour  of  general¬ 
ised  Root-Locus  and  Nyquist  diagrams. 
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9.  ANGLES  OF  ARRIVAL  AT  ZEROS  AND  ANGLES  OF  DEPARTURE  FROM  POLES. 

The  ways  in  which  branches  of  the  root  locus  depart  from  poles  and  arrive  at 
zeros,  together  with  the  relationships  of  other  important  structural  features  of  these 
diagrams  to  the  basic  state-space  model  parameters  has  been  considered  by  a  number  of 
investigators  [91  ,  [111  ,  [411  ,  [42]  .  In  this  section  we  follow  the  approach  of 
Thompson,  Stein  and  Laub  [42]  .  For  the  output- feedback  system 

4  =  Ax  +  Su 

y  =  Cx 

U  =  —  |  Ky  (9.1) 

the  closed- loop  frequency  matrix  will  be 

SK(g)  =  A  -  |  BKC  (9.2) 

Let  s, ,x.  and  y*  be  closed-loop  characteristic  frequencies  with  their  associated  right 
and  left  eigenvectors  so  that  for  0  < g  < " 


[  S„  (g)  -  s,  I  lx,  =0 

K  1  1  i  =  1,2 

y*[SK(g)  -  Sii]  =  o 


,n 


(9.3) 


then  s.,  x.,  and  y*  can  be  obtained  by  solving  the  generalised  eigenvalue  problems 
specified  by 


~  A  -  SiI  B  “i 

'xi ' 

-1  l 

-  C  -gK 

-  wi  - 

(Yi  IV 


A  -  B 


-gK  J 


O 

i  =  1, 2  - - ,p 

0 


where  role  of  the  integer  p  is  discussed  below, 
has  been  shown  by  Thompson  et  al  !  42  that  : 


(9.4) 


(9.5) 


Then, in  terms  of  these  quantities  it 


The  angles  of  the  root  locus  for  0- g*.“>  and  for  distinct  frequencies  s,  are 

given  by 


or 


arg (ds. ) 


arg(ds^) 


arg 


-  y j^BKC  xx 


0  <  g  r.  ® 

l  =  1,2, 


arg 


0  i  g<«> 
i  =  1,2, 


,P 


-P- 


(9.6) 


(9.7) 


The  angles  of  departure  from  poles  are  found  using  (9.6)  with  g  =  «°  and  the  angles  of 

arrival  at  zeros  are  found  using  (9.7)  with  g  =  0.  Note  that  for  g  >0  we  will  have 

p  =  n  and  for  g  =  0  we  will  have  0<p<n  -  m,  where  p  is  the  number  of  finite  zeros 

and  m  is  the  number  of  inputs  and  outputs  of  the  vector  feedback  loop. 

10.  PROPERTIES  OF  NYQUIST  AND  ROOT  LOCUS  DIAGRAMS  FOR  OPTIMAL  FEEDBACK  SYSTEMS. 

Several  investigators  have  studied  the  asymptotic  Nyquist  diagram  behaviour 
(for  high  frequencies),  the  asymptotic  Root-Locus  diagram  behaviour  (as  explained  below) 
and  the  location  of  the  finite  zeros  for  the  standard  optimal  state  feedback  control 
problem  involved  in  minimizing  the  cost  function 

ftn  j 

J  =  (xtQxl+  putRu)  dt  (10.1) 

*  o 

where  R  is  positive  definite  symmetric,  Q  is  positive  semi-definite  symmetric  and  p 
a  finite  positive  real  constant  [12j,[42|  ,[  43  1 ,  [  44  ] ,  [  45  ] ,  [461,  [47!. 


It  is  known  as  a  result  of  these  studies  that  : 


(i)  All  the  finite  zeros  of  the  optimal  feedback  loop  lie  in  the  left-half 
complex  plane. 

(ii)  All  the  generalised  Nyquist  diagrams  for  the  optimal  feedback  loop  have 
infinite  gain  margin  and  at  least  60°  phase  margin. 


mo 


(iii)  When  p in  the  performance  index  (10.1)  is  “  the  n  branches  of  the 
optimal  root  locus  (i.e.  the  locus  of  closed- loop  characteristic  frequencies  traced  out 
with  variation  of  o  )  start  on  a  set  of  poles  which  are  the  stable  poles  of  the  original 
plant  together  with  the  mirror  images  in  the  imaginary  axis  of  the  unstable  poles  of  the 
original  plant  (assumed  to  have  no  purely  imaginary  poles) .  As  p varies  all  the  branches 
of  the  optimal  root  locus  remain  in  the  left-half  of  the  frequency  plane.  For p tending 
to  zero  a  number  p  of  the  branches  (where  CHp- n-m)  stay  finite  and  approach  a  set  of 
loop  transmission  zeros.  The  remaining  (m-p)  branches  approach  infinity  in  a  set  of  so- 
called  Butterworth  patterns.  A  k  order  Butterworth  pattern  has  k  asymptotes  each  of 
which  radiates  from  the  origin  through  the  left-half  plane  solutions  of 
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k  +  1 


(10.2) 


11.  DESIGN  TECHNIQUES. 


Despite  the  great  efforts  which  have  been  expended  on  the  problems  involved  over 
the  past  decade,  much  remains  to  be  done  in  formulating  a  definitive  design  technique  for 
linear  multivariable  feedback  systems  of  high  (say  having  up  to  100  state  variables) 
dynamic  order.  A  fairly  comprehensive  attempt  to  extend  the  philosophy  and  techniques 
of  the  classical  Nyquist-Bode-Evans  design  approach  to  the  multivariable  case  is  given  in 
1 2 ]  ,  and  this  line  of  attack  can  be  compared  with  a  variety  of  other  approaches  to  a 
common  design  problem  in  [48]  .  A  fairly  complete  rpview  of  the  development  of  the 
classical  frequency-response  methods  can  be  found  in  [ 1 ]  ,  together  with  a  discussion  of 
various  forms  of  extension  to  the  multivariable  case.  A  computer-aided  interactive- 
design  package  [49]  has  been  developed  for  use  in  the  implementation  of  the  techniques 
described  in  [21  .  Experience  of  its  use  has  shown  that  an  interactive  design  method  of 
this  sort  can  be  useful  to  industrial  designers.  There  is  thus  a  considerable  incentive 
to  develop  further  those  forms  of  design  technique  which  are  based  on  a  combination  of 
the  attractive  graphical  features  of  complex-variable  methods  and  the  geometrical  way  in 
which  the  main  structural  features  of  the  algebraic  curves  used  are  related  to  state-space 
model  parameters. 

11.1  Design  via  generalised  Nyquist  and  generalised  Root  Locus  approach. 

The  configuration  considered  is  shown  in  Fig. 15.  A  given  plant  has  i  inputs  and 
m  outputs,  and  it  is  assumed  that  m  >  l.  The  plant  output  vector  y  consists  of  two 
subvectors  c  and  z: 


V  = 


where  c  is  an  i-dimensional  vector  of  variables  whose  behaviour  is  to  be  controlled  and 
z  is  an  (m-<)-dimensional  vector  of  extra  available  measurements.  The  design  process  is 
carried  out  in  two  main  stages,  corresponding  to  the  arrangement  shown  in  Fig. 16. 


Stage  1  -  Compensator  Design:  The  purpose  of  the  compensator  is  to  make  use  of 

the  extra  information  contained  in  the  vector  z  to  create  a  new  ^-dimensional  input 
vector  v  which  gives  a  suitable  form  of  transmittance  from  v  to  the  set  of  controlled 
outputs  c.  A  suitable  form  of  this  transmittance  is  taken  to  be  one  which  allows 
acceptably  large  amounts  of  feedback  gain  to  be  applied  around  this  modified  plant. 


Stage  2  -  Gain  Injector  Design:  Having  used  all  of  the  available  measurements 

to  the  best  advantage  in  arranging  for  a  suitable  transmission  characteristic  between  v 
and  c,  the  design  is  completed  by  choosing  a  gain  injector  block  which  will  inject  the 
suitably  high  feedback  gains  required  to  meet  the  closed-loop  performance  specification 
over  the  operating  bandwidth.  The  gain  injector  block  will  normally  contain  both 
integral  action  terms  to  remove  low-frequency  errors  and  phase-compensating  terms  to 
improve  closed-loop  damping.  Early  treatments  of  this  part  of  the  approach  are  often 
called  the  characteristic  locus  method  because  it  is  essentially  based  on  the  manipulation 
of  characteristic  gain  loci  (generalised  Nyquist  diagrams) . 

The  basic  idea  behind  the  compensator  design  is  best  seen  from  a  consideration 
of  the  arrangement  shown  in  Fig. 17 (a)  where  the  gain-injector  block  has  been  replaced  by 
a  unit  transmittance  operator.  In  this  arrangement  the  compensator  is  designed  in  such  a 
way  that  a  suitable  set  of  closed-loop  characteristic  frequencies  is  obtained.  How  the 
suitability  of  such  a  set  is  judged  is  suggested  from  a  study  of  the  implications  of 
Fig. 17.  The  set  of  closed  loop  characteristic  frequencies  associated  with  Fig. 17(a) 
depends  on  the  nature  of  the  characteristic  gain  loci  for  the  transmittance  between  v 
and  c  in  Fig. 17(b).  Hence,  the  manipulation  of  the  closed-loop  frequencies  for  the 
arrangement  of  Fig. 17(a)  is  equivalent  to  the  manipulation  of  the  characteristic  gain 
loci  for  the  compensated  plant  of  Fig. 18. 


Design  of  Gain  Injector:  Assume  that  the  first  stage  of  the  design  procedure 
has  been  carried  out:  one  is  then  considering  the  second-stage  situation  shown  in 

Fig.  19  where  K(s)  is  the  transfer-function  matrix  of  the  gain  injector  and  Gc(s)  is  the 
transfer- function  matrix  of  the  compensated  plant. 

Let 

Q(s)  =  Gc(s)K(s)  (11.1) 


T 

i 
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so  that  the  closed-loop  transfer-function  matrix  for  this  arrangement  is 

Rts)  =  [I  +  Q ( s) ]-1Q (s) .  (11.2) 

m 

It  follows  from  standard  relationships  in  matrix  algebra  that  every  eigenvector  w,  (s)  of 
Q ( s )  is  also  an  eigenvector  of  R(s),  and  that  for  every  eigenvalue  q,  (s)  of  Q(s)  there  is 
a  corresponding  eigenvalue  q,(s)/[l  +  q, (s)Jof  R(s).  Since  the  set1of  complex  numbers 
{q, (1  +  q  )  :  i  =  l,2,...,mj  will  have  two  or  more  members  with  the  same  value  if  and 

only  if  trie  set  of  complex  numbers  (q.  :  1  =  l,2,...,m)  has  two  or  more  members  with  the 
same  value,  it  follows  that  the  set  of  branch  points  associated  with  the  matrices  Rts) 
and  Q(s)  are  identical.  The  characteristic  gain  algebraic  functions  associated  with  the 
open-loop  matrix  Q(s)  and  the  closed-loop  matrix  R(s)  thus  share  the  same  Riemann 
surface,  and  these  matrices  also  share  the  same  set  of  eigenvectors.  Therefore,  if  the 
open-loop  matrix  Q(s)  has  a  dyadic  expansion  of  the  form 


Q(s> 


m 

l 

i  =  l 


qi  (s)wi  (s)  v^  (s) 


(11.3) 


valid  for  almost  all  values  of  s  (that  is,  except  at  the  branch  points) , then  R(s)  will 
have  a  corresponding  dyadic  expansion  of  the  form 


R(s) 
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i=l 


'q^s) 

T+q^TsT 


wi (s) vx 


(11.4) 


valid  in  the  same  punctured  plane  as  Q(s) (that  is,  the  complex  plane  with  the  set  of 
branch  points  deleted) . 


Equation  (11.4)  can  be  made  the  basis  of  a  design  technique  for  systems  having 
the  same  number  of  inputs  and  outputs.  The  closed-loop  frequency  response  is  given  by 


R(jw) 
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wi  ( j  u)  v1  (  ju) 


(11.5) 


If 


q i  ( j tu )  -»«■  ,  i  =  1 , 2  , .  . .  ,m, 

then 

m  t 

R(]»u)  -  I  w(  ju)  v.  (jo)  =  I  , 
i  =  l  1  m 


where  lm  is  a  unit  matrix  of  order  m,  so  that  good  closed-loop  performance  may  be  achiev¬ 
ed  by  making  all  the  characteristic  gains  of  Q(s)  sufficiently  high  over  a  required  operat¬ 
ing  frequency  range.  The  amounts  of  characteristic  gain  required  for  a  given  quality  of 
closed-loop  performance,  and  the  amount  of  closed- loop  interaction  which  will  result, 
depend  on  the  behaviour  with  frequency  of  the  eigenvectors  of  Q(s).  The  desired  propert¬ 
ies  of  the  characteristic  gain  loci  and  characteristic  gain  directions  (eigenvectors)of 
G  (s)K(s)  are  therefore  determined  by  the  following  considerations. 

1)  The  characteristic  gain  loci  (generalised  Nyquist  diagrams)  of  G  (s)K(s) 

must  satisfy  the  generalised  Nyquist  stability  criterion.  c 

2)  For  high  tracking  performance,  and  therefore  low  interaction,  at  any  stipul¬ 
ated  frequency,  the  gains  of  all  the  characteristic  loci  must  be  suitably  large.  This  is 
particularly  relevant  to  the  low-frequency  situation  in  which  it  is  feasible  to  have  high 
characteristic  gains. 

3)  At  high  frequencies,  however,  it  is  not  generally  feasible,  because  of  the 
excessive  power  requirements  involved,  to  have  large  gains  in  the  characteristic  loci. 

Such  high  gains  at  high  frequencies  would,  in  many  cases,  also  tend  to  violate  the 
generalised  Nyquist  stability  criterion.  To  reduce  interaction  at  high  frequencies, 
therefore,  one  cannot  in  general  deploy  high  gains  for  this  purpose  and  so  must  instead 
make  the  characteristic  direction  set  of  the  open-loop  transference  align  with  the 
standard  basis  vector  set  at  high  frequencies. 

The  principal  components  of  a  design  approach  exploiting  these  ideas  are, 
therefore, 


a)  a  method  of  manipulating  characteristic  gain  loci,  and 

b)  a  method  of  manipulating  characteristic  gain  directions. 


Manipulation  of  Characteristic  Gains  and  Directions:  From  the  spectral 

analysis  point  of  view  the  essential  difficulty  in  choosing  an  appropriate  form  cf  con¬ 
troller  matrix  K(s)  lies  in  the  fact  that  very  little  is  known  of  the  way  in  which  the 
eigenvalues  and  eigenvectors  of  the  product  of  two  matrices  G  (s)  and  K(s)  are  related 
to  the  individual  eigenvalues  and  eigenvectors  of  G  (s)  and  KTs) .  There  is  one 
exception  to  this  situation,  however,  and  this  is  that  in  which  the  matrices  concerned 
commute,  that  is,  when 


Gc(s) K(s)  =  K(s)Gc<s) . 
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This  happens  if  and  only  if  G  (s)  and  K(s)  have  a  common  set  of  eigenvectors.  In  such 
a  situation  the  eigenvalues  of  the  product  Gc(s)K(s)  are  simply  appropriate  products  of 
the  individual  eigenvalues  of  the  matrices  G^(s)  and  K(s);  the  eigenvalues  which  are 
multiplied  together  are  those  associated  with  a  common  eigenvector. 

Since  the  equation 

det  [glm  -  Gc(s)  ]=  0 

will  normally  be  irreducible,  the  eigenvalues  and  eigenvectors  of  the  matrix  G  (s)  will 
not  be  expressible  in  terms  of  rational  functions.  It  will  therefore  not  normally  be 
feasible  to  construct  a  realizable  controller  K(s)  which  will  exactly  commute  with  G  (s) 
for  all  values  of  s.  Even  in  the  rare  cases  when  this  might  be  possible,  such  a  c 
controller  could  be  unnecessarily  complicated.  A  more  practical  and  rewarding  approach 
is  to  investigate  the  possibility  of  using  controllers  which  are  approximately  commuta¬ 
tive  with  the  plant  at  appropriately  chosen  values  of  frequency.  In  one  approach  of 
this  sort  (MacFarlane  and  Kouvaritakis  (501) the  controller  matrix  is  chosen  to  have  the 
specific  form 

K  ( s )  =  H  ( s )  J 

where  H  and  J  are  matrices  with  real  elements  and  A^(s)  is  a  diagonal  matrix  of 
rational  functions  in  s.  The  matrix  K(s)  can  therefore  be  expressed  in  the  dyadic 
form 

m  t 

K ( s )  =  Z  k(s)h.j 

j=l 

where  the  vector  sets 

{hj  :  i  =  1,2,... ,m}  and  ( j ^  :  i  =  1,2,. . . ,m) 

are  both  real.  An  approximately-commutative  controller  is  one  whose  constituent  real 
matrices  H  and  J  are  chosen  in  such  a  way  that  K(s)  is  approximately  commutative  with 
G  (s)  at  some  specific  value  of  complex  frequency  §.  At  such  a  frequency  one  has  that 

qi(i)=gi(s)ki(s),  i  =  1,2,.  .  .  ,  m 

where  q^(s),g,(s)  are  eigenvalues  of  Q(s)  and  Gc(s),  respectively,  and  k.(s)  are  appro¬ 
priate  diagonal  elements  of  A.  f s ) .  This  set  of  relationships  gives  an  obvious  basis 
for  manipulating  the  characteristic  gain  loci  of  Q(s)  in  a  systematic  manner;  it  also 
turns  out  that  this  approximately  commutative  technique  can  be  used  to  manipulate  the 
eigenvectors  of  Q(s)  .  Such  manipulation  techniques  nave  been  described  by  MacFarlane 
and  Kouvaritakis  [501  . 

Design  of  the  Inner-Loop  Compensator:  The  ideas  behind  the  design  of  the 

inner-loop  compensator  are  illustrated  by  Figs. 20-22.  The  form  of  the  characteristic 
gain  loci  for  the  compensated  plant  G  (s),  that  is,  for  the  transmittance  between  v  and 
c  in  Fig. 17(b)  ,  is  directly  related  tS  the  nature  of  the  closed-loop  characteristic 
frequencies  for  the  arrangement  of  Fig. 17(a),  which  in  turn  is  equivalent  to  Figs. 20  and 
21.  Thus  the  problem  of  choosing  the  component  blocks  F  and  F  of  Fig. 20  for  the 
compensator  is  essentially  the  problem  of  choosing  the  feeSback  matrix  F  in  the  con¬ 
figuration  of  Fig. 21  in  such  a  way  that  a  suitable  set  of  closed-loop  characteristic 
frequencies  is  obtained.  Since  F  has  m  inputs  and  l  outputs,  where  £<m,  the  choice  of 
F  is  often  referred  to  as  a  "squaring-down  procedure".  The  choice  of  F  is  essentially 
based  on  generalised  root-locus  ideas  and  involves  the  placement  of  finite  zeros  and  the 
manipulation  of  root-locus  asymptotes  (MacFarlane,  Kouvaritakis,  and  Edmunds  [10]).  The 
final  objective  of  the  "inner-loop  compensator"  design  is  to  get  suitable  locations  for 
the  closed-loop  poles  seen  by  the  gain  injector  K(s)  in  Fig. 22.  A  satisfactory  achieve¬ 
ment  of  this  objective  requires  some  dynamical  insight  and  engineering  judgment  since, 
in  the  usual  case  when  all  the  plant  states  are  not  directly  accessible,  it  will  not 
normally  be  possible  to  place  finite  zeros  in  arbitrarily  specified  locations  in  the 
complex  plane.  Nevertheless,  the  basic  ideas  underlying  the  technique  of  inner-loop 
compensator  design  are  quite  straightforward;  one  places  finite  zeros  and  manipulates 
root-locus  asymptote  patterns  in  such  a  way  that,  by  setting  a  gain  parameter  at  a 
suitable  value,  the  closed-loop  poles  of  the  system  "seen  by"  the  gain-injector  K(s) 
are  pulled  into  suitable  locations  in  the  complex  plane.  In  some  circumstances  one  may 
wish  to  pull  such  closed-loop  poles  well  over  into  the  left-half  complex  plane  in  order 
to  speed  up  closed-loop  response  and  to  allow  K(s)  to  inject  a  desired  amount  of  gain 
before  the  overall  (inner  plus  outer  loops)  system  has  its  closed-loop  characteristic 
frequencies  driven  into  unacceptable  regions  of  the  complex  plane.  In  other  circumstances 
one  may  wish  to  "freeze"  the  location  of  major  poles  associated  with  certain  open-loop 
plant  characteristic  frequencies  by  placing  finite  zeros  near  them.  Often  a  (judicious 
mixture  of  both  approaches  will  be  required. 

Although  it  requires  engineering  insight  and  manipulative  skill,  it  has  been 
shown  to  be  a  powerful  tool  in  the  hands  of  a  skilled  designer.  A  fairly  extensive 
computer-aided  design  package  has  been  developed  for  use  in  this  way  [49]  ,[51]  . 
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11.2  Least-squares  Nyquist/Bode  Array  Approach. 

An  alternative  approach  has  been  developed  by  Edmunds  I  52  ]  which  transfers  the 
burden  of  the  designer's  choice  to  the  selection  of  a  closed-loop  performance  specifica¬ 
tion  and  a  controller  structure.  This  approach  uses  Nyquist  and  Bode  diagram  arrays  of 
the  open-loop  and  closed-loop  behaviour  as  a  means  of  assessing  performance  and  inter¬ 
action.  Following  a  preliminary  analysis  of  the  problem  (using  the  techniques  of 
Section  11.1  above)  the  designer  chooses  a  suitable  specification  for  the  closed- loop 
behaviour  of  the  feedback  system  together  with  a  stipulated  structure  for  the  feedback 
controller  to  be  used.  The  controller  structure  selected  has  a  number  of  variable 
parameters  and  the  design  is  completed  by  a  computer  optimization  (using  say  a  standard 
weighted  least-squares  algorithm)  of  the  difference  between  the  specified  and  actual 
closed-loop  performances.  As  one  would  expect  this  method,  when  properly  used,  produces 
better  results  than  that  of  Section  11.1  above.  It  has  the  further  advantage  of  being 
extendable  to  deal  with  variations  in  the  system  model  parameters.  The  CAD  package 
mentioned  above  [49]  ,  [511  incorporates  the  routines  for  dealing  with  this  approach. 

11.3  Example  of  frequency-response  design  approach. 

Fig. 23  shows  the  Nyquist  array  for  a  2-input,  2  out-put,  12-state  automotive  gas 
turbine.  Since  all  the  elements  of  this  array  are  of  the  same  order  of  magnitude,  out¬ 
put  scaling  as  an  initial  step  in  the  design  procedure  is  not  required.  The  character¬ 
istic  loci  (generalised  Nyquist  diagrams)  for  this  system  are  shown  in  Fig. 24.  An 
inspection  of  these  immediately  shows  that  no  serious  design  difficulties  should  arise; 
the  speed  with  which  such  an  initial  assessment  can  be  carried  out  is  one  of  the  great 
advantages  of  the  generalised  frequency-response  approach.  The  next  step  in  the 
procedure  for  this  example  was  to  carry  out  a  ‘high  frequency  alignment'  (2]  at  a 
frequency  of  3  rad/sec.  The  frequency  used  is  usually  that  corresponding  to  the  target 
closed-loop  bandwidth.  Figs.  25  and  26  show  the  generalised  Nyquist  and  Bode  plots  of 
the  system's  characteristic  gain  functions  after  this  alignment  step.  An  inspection  of 
these  Bode  plots  shows  that  one  of  the  characteristic  gains  is  fairly  'flat'  near  the 
critical  frequency,  so  some  extra  phase  compensation  (s+5)/(s+l)  is  next  added  to  this 
locus  using  an  ' approximately  commutative  controller'  [2)  -alculated  at  a  frequency  of 
3  rad/sec.  Fig.  27  shows  the  'misalignment  angles'  [2!  following  the  high-frequency 
alignment  procedure.  It  can  be  seen  that  the  misalighment  angles  have  been  suitably  re¬ 
duced  except  for  frequencies  in  the  region  of  1.5  to  2  rad/sec. 

Fig. 28  shows  Bode  plots  of  the  characteristic  gains  with  an  alignment  at  3 
rad/sec.  followed  by  the  approximately  commutative  controller  at  3  rad/sec. ,  followed 
by  a  further  alignment  at  3  rad/sec.  It  can  be  seen  that  the  gains  are  now  reasonably 
well  balanced.  To  complete  the  design  extra  compensation  is  now  added  for  low  and 
medium  frequencies.  A  proportional-plus-integral  control  term  (s+l)/s  is  chosen  for 
the  lower-gain  locus  at  lower  frequencies.  The  higher-gain  locus  needs  some  extra 
phase  advance  at  frequencies  just  below  1  rad/sec.  and  hence  a  term  of  (  (s+0.2)  (s+0.4)  )/ 
fs(s+1.5)l  is  used  for  it.  This  compensation  is  injected  via  an  'approximately 
commutative  controller'  at  a  frequency  of  0.7  rad/sec. 

The  generalised  Nyquist  and  Bode  diagrams  for  the  final  compensated  system 
(with  an  align  at  3,  a  commutative  controller  at  3,  a  further  align  at  3,  and  a  final 
commutative  controller  at  0.7)  are  shown  in  Figs.  29  and  30.  These  show  good  stability 
margins  and  gains  which  are  well-balanced  over  a  wide  range  of  frequencies.  Fig. 31 
shows  the  closed- loop  gain  array  (in  Bode  form);  from  this  we  see  that  the  closed-loop 
system  has  a  bandwidth  of  about  10  rad/sec.  in  the  first  loop  and  of  about  3  rad/sec.  in 
the  second  loop.  Fig.  32  shows  the  magnitudes  of  the  principal  gains  of  the  closed- 
loop  system  as  a  function  of  frequency.  These  show  that  the  system  will  be  reasonably 
robust  against  multiplicative  perturbations  in  the  open-loop  transfer-function. 
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SUMMARY 

This  paper  uses  singular  value  analysis  to  generalize  the  fundamental  feedback  ideas 
from  classical  single  loop  control  theory  to  multiloop  systems.  The  classical  view  of 
measuring  the  benefit  of  feedback  in  terms  of  desensitizing  the  system  to  plant 
variations  and  disturbances  is  discussed.  Uncertainty  is  shown  to  play  a  critical  role 
in  determining  the  way  in  which  feedback  may  be  used.  Certain  singular  value  plots, 
called  o-plots,  are  introduced  as  natural  and  effective  generalizations  of  Bode  gain 
plots  and  form  the  basic  tools  for  analysis  of  multiloop  feedback  systems.  These  tools 
provide  reliable  means  for  assessing  the  stability  margins,  bandwidth,  and 
desensitizing  effects  of  multiloop  feedback  systems.  Examples  are  given  to  illustrate 
the  use  of  the  o-plots  and  their  interpretations.  Most  of  this  paper  is  excerpted 
from  other  papers  already  published  ((1)  -  (41). 


I.  Introduction 

In  designing  control  systems,  there  are  fundamental  issues  that  transcend  the 
boundaries  of  specific  applications.  Although  they  may  be  packaged  slightly 
differently  for  each  application  and  may  have  different  levels  of  importance,  these 
issues  are  generic  in  their  relationship  to  control  design  objectives  and  procedures. 
Central  to  these  issues  is  the  control  requirement  to  provide  satisfactory  performance 
in  the  face  of  system  variations  and  uncertainty.  This  requirement  was  the  original 
motivation  for  the  development  of  feedback  systems.  Feedback  is  only  required  when 
system  performance  cannot  be  achieved  because  of  uncertainty  in  system 
characteristics.  This  paper  is  based  on  the  premise  that  the  recognition  and  proper 
treatment  of  uncertainty  holds  the  key  to  viable  solutions  for  essentially  all  feedback 
design  problems.  Section  II  qualitatively  discusses  the  relationship  between  design 
models  and  the  real  world.  This  includes  how  the  use  of  models,  with  their  associated 
uncertainties,  impact  the  control  design  process. 

Section  III  reviews  the  fundamental  practical  issue  in  feedback  design  --  namely,  how 
to  achieve  the  benefits  of  feedback  in  the  face  of  uncertainties.  The  nominal  design 
models  used  are  assumed  to  be  finite  dimensional,  linear  and  time  variant  (FDLTI)  and 
the  approach  is  basically  classical  ((5]  -  [6])  expressed  in  terms  of  transfer 
functions  in  the  frequency  domain.  Singular  values  are  used  to  generalize  the 
classical  single-input  single-output  (SISO)  results  to  multi-input  multi-output  (MIMO) 
systems. 

In  Section  IV  the  problem  of  representing  uncertainty  in  the  design  model  is  discussed 
in  more  detail.  Various  types  of  uncertainties  which  can  arise  in  physical  systems  are 
briefly  described  and  "unstructured  uncertainties"  are  singled  out  as  generic  errors 
which  are  associated  with  all  design  models.  Section  V  then  shows  how  classical  SISO 
statements  of  the  feedback  design  problem  in  the  face  of  unstructured  uncertainty  can 
be  reliably  generalized  to  MIMO  system.  Section  VI  presents  some  examples  to 
illustrate  the  concepts  discussed  in  the  preceding  sections.  Conclusions  follow  in  the 
last  Section  VIII. 

II.  Modeling  and  Uncertainty 

Most  control  designs  are  based  on  the  use  of  a  design  model.  The  relationship  between 
models  and  the  reality  they  represent  is  subtle  and  complex.  Thus,  the  problems 
created  by  model  uncertainty  have  often  been  either  trivialized  or  ignored  in 
theoretical  studies  in  favor  of  assuming  the  alternative  of  no  distinction  between 
models  and  reality. 

A  mathematical  model  provides  a  map  from  inputs  to  responses.  The  quality  of  a  model 
depends  on  how  closely  its  responses  match  those  of  the  true  plant.  Since  no  single 
fixed  model  can  respond  exactly  like  the  true  plant,  we  need,  at  the  very  least,  a  set 
of  maps.  However,  the  modeling  problem  is  much  deeper  --  the  universe  of  mathematical 
models  from  which  a  model  set  is  chosen  is  distinct  from  the  universe  of  physical 
systems.  Therefore,  a  model  set  can  never  be  constructed  which  includes  the  true 
physical  plant.  It  is  necessary  for  the  engineer  to  make  a  leap  of  faith  regarding  the 
applicability  of  a  particular  design  based  on  a  mathematical  model.  To  be  practical,  a 


design  technique  must  help  make  this  leap  small  by  accounting  for  the  inevitable 
inadequacy  of  models.  A  qood  model  should  be  simple  so  as  to  facilitate  design,  yet 
complex  enough  to  give  the  engineer  confidence  that  designs  based  on  the  model  will 
work  on  the  true  plant. 

The  term  uncertainty  refers  to  the  differences  or  errors  between  models  and  reality  and 
whatever  mechanism  is  used  to  express  these  errors  will  be  called  a  representation  of 
uncertainty.  For  example,  consider  the  problem  of  bounding  the  magnitude  of  the  effect 
of  some  uncertainty  on  the  output  of  a  nominally  fixed  linear  system.  A  useful  measure 
of  uncertainty  in  this  context  is  to  provide  a  bound  on  the  spectrum  of  the  deviation 
of  the  output  from  its  nominal  response.  In  the  simplest  case,  this  spectrum  is 
assumed  to  be  independent  of  the  input.  This  is  equivalent  to  assuming  that  the 
uncertainty  is  generated  by  an  additive  noise  signal  with  bounded  spectrum;  the 
uncertainty  is  represented  as  additive  noise.  Of  course,  no  physical  system  is  linear 
with  additive  noise,  but  some  aspects  of  physical  behavior  are  approximated  quite  well 
using  this  model.  This  type  of  uncertainty  has  received  a  great  deal  of  attention  in 
the  literature,  perhaps  more  because  it  yields  elegant  theoretical  solutions  (e.g., 
white  noise  propagation  in  linear  systems,  Wiener  and  Kalman  filtering,  LQG)  than 
because  it  is  of  greater  practical  significance  than  other  types  of  uncertainty. 

Generally,  the  spectrum  of  the  deviation  of  the  true  output  from  the  nominal  will 
depend  significantly  on  the  input.  For  example,  an  additive  noise  model  is  entirely 
inappropriate  for  capturing  uncertainty  arising  from  variations  in  the  material 
properties  of  physical  plants.  The  actual  construction  of  model  sets  for  more  general 
uncertainty  can  be  quite  difficult.  Suppose  we  begin  with  a  differential  equation  ror 
our  model.  We  immediately  recognize  the  need  to  parametrize  the  differential  equation 
to  reflect  plant  variations  relative  to  our  model.  However,  for  certain  classes  of 
signals  (e.g.,  high  frequency)  the  parametrized  differential  equation  fails  to  describe 
the  plant,  because  the  plant  will  always  have  dynamics  which  are  not  represented  in  the 
differential  equation. 

In  general,  we  are  forced  to  use  model  sets  that  allow  for  plant  dynamics  which  are  not 
explicitly  represented  in  the  model  structure.  A  simple  example  of  this  involves  using 
frequency-domain  bounds  on  transfer  functions  to  describe  a  model  set.  To  use  such 
sets  to  describe  physical  systems,  the  bounds  must  roughly  grow  with  frequency.  In 
particular,  at  sufficiently  high  frequencies,  phase  is  completely  unknown,  i.e., 

+180°  uncertainties.  This  is  a  consequence  of  dynamic  properties  which  inevitably 
occur  in  physical  systems.  These  issues  will  be  discussed  further  in  Section  IV. 

Assuming  we  are  given  a  model  including  a  representation  of  uncertainty,  which  we 
believe  adequately  captures  the  essential  features  of  the  plant,  the  next  step  in  the 
controller  design  problem  is  to  determine  what  structure  is  necessary  to  achieve  the 
desired  performance.  Prefiltering  of  input  signals  can  change  the  dynamic  response  of 
the  model  set  but  cannot  reduce  the  effect  of  uncertainty.  If  the  uncertainty  is  too 
great  to  achieve  the  desired  accuracy  of  response,  then  a  feedback  structure  is 
required.  The  mere  assumption  of  a  feedback  structure,  however,  does  not  guarantee  a 
reduction  of  uncertainty,  and  there  are  many  obstacles  to  achieving  the  uncertainty- 
reducing  benefits  of  feedback. 

In  particular,  since  for  any  reasonable  model  set  representing  a  physical  system, 
uncertainty  becomes  large  and  phase  is  completely  unknown  at  sufficiently  high 
frequencies,  the  loop  gain  must  be  small  at  those  frequencies  to  avoid  destabilizing 
the  high  frequency  system  dynamics.  Even  worse  is  that  the  feedback  system  actually 
increases  uncertainty  and  sensitivity  in  the  frequency  ranges  where  uncertainty  is 
sufficiently  large.  In  other  words,  because  of  the  type  of  sets  required  to  reasonably 
model  physical  systems  and  the  restriction  that  our  controllers  be  causal,  we  cannot 
use  feedback  (or  any  other  control  structure)  to  cause  our  closed-loop  model  set  to  be 
a  proper  subset  of  the  open-loop  model  set.  Often,  what  can  be  achieved  with 
intelligent  use  of  feedback  is  a  significant  reduction  of  uncertainty  for  certain 
signals  of  importance  with  a  small  increase  spread  over  other  signals.  Thus,  the 
feedback  design  problem  centers  around  the  tradeoff  involved  in  reducing  the  overall 
impact  of  uncertainty.  This  tradeoff  also  occurs  for  example,  when,  using  feedback  to 
reduce  command/disturbance  error  while  minimizing  response  degradation  due  to 
measurement  noise.  To  be  of  practical  value,  a  design  technique  must  provide  means  for 
performing  these  tradeoffs. 

III.  The  Benefits  of  Feedback 

We  will  deal  with  the  standard  feedback  configuration  illustrated  in  Figure  1: 
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Figure  1:  Standard  Feedback  Configuration 


It  consists  of  the  interconnected  plant  (G)  and  controller  (K)  forced  by  commands  (r), 
measurement  noise  ( n)  ,  and  disturbances  (d) .  The  dashed  precompensator  (P)  is  an 
optional  element  used  to  achieve  deliberate  command  shaping  or  to  represent  a  non-unity 
feedback  system  in  equivalent  unity  feedback  form.  All  disturbances  are  assumed  to  be 
reflected  to  the  measured  output  (y) ,  all  signals  are  multivariable,  in  general,  and 
both  nominal  mathematical  models  for  G  and  K  are  finite  dimensional  linear  time 
invariant  ( FDLTI )  systems  with  transfer  functions  matrices  G(s)  and  K(s).  Then  it  is 
well  known  that  the  configuration,  if  it  is  stable,  has  the  following  major  properties: 

(1)  Input-output  behavior 

y  =  GK  ( I  +  GK)'1  (r  -  n)  +  (I  +  GK)_1d  (1) 


=  (I  +  GK)*1  (r  -  d)  +  GK ( I  +  GKJ'in  (2) 

(2)  System  sensitivity  (7] 

AHcl  =  (I  +  G'KJ-ihHoi  (3) 

In  equation  (3),  AHci  and  AH0i  denote  changes  in  the  closed  loop  system  and 

changes  in  a  nominally  equivalent  open  loop  system,  respectively,  caused  by  changes  in 

the  plant  G,  i.e.,  G‘  =  G  +  AG. 

Equations  (1)  through  (3)  summarize  the  fundamental  benefits  and  design  objectives 
inherent  in  feedback  loops.  Specifically,  equation  (2)  shows  that  the  loop's  errors  in 
the  presence  of  commands  and  disturbances  can  be  made  "small"  by  making  the" sensi tivity 
operator,  or  inverse  return  difference  operator,  (I  +  GK)"1,  "small”,  and  equation 

(3)  shows  that  loop  sensitivity  is  improved  under  these  same  conditions,  provided  G' 
does  not  stray  too  far  from  G.  An  alternative  approach  which  measures  sensitivity  in 
terms  of  G  and  not  G'  is  given  in  the  companion  paper  [191. 

For  SISO  systems,  the  appropriate  notion  of  smallness  for  the  sensitivity  operator  is 
well  understood  -  namely,  we  require  that  the  complex  scalar  [1  +  g(jw)  Mjw)]"1 
have  small  magnitude,  or  equivalently  that  1  +  g(jto)  k(ju>)  have  large  magnitude, 
for  all  real  frequencies  u  where  the  commands,  disturbances  and/or  plant  changes, 

AG,  are  significant.  In  fact,  the  performance  objectives  of  SISO  feedback  systems 
are  commonly  stipulated  in  terms  of  explicit  inequalities  of  the  form 

ps(w)  <  il  +  g(jw)  k(]w)i  v- or  <  u0,  (4) 

where  ps(io)  is  a  (large)  positive  function  and  wQ  specified  the  active  frequency 
range . 

This  basic  idea  can  be  readily  extended  to  MIMO  problems  through  the  use  of  matrix 
norms.  Selecting  the  spectral  norm  as  our  measure  of  matrix  size,  for  example,  the 
corresponding  feedback  requirements  become 

~  [(I  +  G(jw)  K(jw))"1]  small 

or  conversely 

ps(u)  <_  0(1  +  G  (jtv)  K(ju))l  (5) 

for  the  necessary  range  of  frequencies.  The  symbols  7T  and  £  in  these  expressions 
are  defined  as  follows: 

-  !ft»  =  IUII--1  IMI  N 

o  [ A |  ,£]}.,  life  II  \  WA*A| 

where  l  l  •  l  l  is  the  usual  Euclidian  norm,  A [ • J  denotes  eigenvalues,  and  [•)* 
denotes  conjugate  transpose.  The  two  o's  are  called  maximum  and  minimum  singular 
values  of  A  (or  principal  gains  Ml),  respectively,  and  can  be  calculated  with 
available  linear  system  software. (8)  More  discussion  of  singular  values  and  their 
properties  can  be  found  in  various  texts. [9] 

Condition  (5)  on  the  return  difference  I  +  GK  can  be  interpreted  as  merely  a 
restatement  of  the  common  intuition  that  large  loop  gains  or  "tight”  loops  yield  good 
performance.  This  follows  from  the  inequalities 

0|GK)  ■  1  <  0(I+GK)  <  O  [  GK  ]  + 1  (8) 

which  show  that  return  difference  magnitudes  approximate  the  loop  gains,  o(GK] , 
whenever  these  are  large  compared  with  unity.  Evidently,  good  multivariable  feedback 
loop  design  boils  down  to  achieving  high  loop  gains  in  the  necessary  frequency  range. 

Despite  the  simplicity  of  this  last  statement,  it  it  clear  from  years  of  research  and 
design  activity  that  feedback  design  is  not  trivial.  This  is  true  because  loop  gains 
cannot  be  made  arbitrarily  high  over  arbitrarily  large  frequency  ranges.  Rather,  they 


(6) 

(7) 
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must  satisfy  certain  performance  tradeoffs  and  design  limitations.  A  major  performance 
tradeoff,  for  example,  concerns  command  and  disturbance  error  reduction  versus  sensor 
noise  error  reduction.  110)  The  conflict  between  these  two  objectives  is  evident  in 
equation  (2).  Large  o[GK(jui))  values  over  a  large  frequency  range  make  errors  due 
to  r  and  d  small.  However,  they  also  make  errors  due  to  n  large  because  this  noise 
is  "passed  through"  over  the  same  frequency  range,  i.e., 

y  =  GK(j<J)  (I+GK(ja))  |_1n  i  In  (9) 

Worse  still,  large  loop  gains  can  make  the  control  activity  (variable  u  in  Figure  1) 
quite  unacceptable.  This  follows  from 

u  =  K  1 I+GK] -l  (r-n-d)  i  G~  1  (  ju>)  (  r-n-d)  (10) 

Here  we  have  assumed  G  to  be  square  and  invertible  for  convenience.  The  resulting 
equation  shows  that  commands,  disturbances  and  sensor  noise  are  actually  amplified  at  u 
whenever  the  frequency  range  significantly  exceeds  the  bandwidth  of  G;  i.e.,  for  u 
such  that  cf  [ G  (  j 'o )  ]  <<  1  we  get 

~l G“ 1  ( ju>)  )  =  1  >>  1  (11) 

_o(G(io>)  ) 

One  of  the  major  contributions  of  modern  feedback  theory  is  the  development  of 
systematic  procedures  for  conducting  the  above  performance  tradeoffs.  We  are 
referring,  of  course,  to  the  LQG  theory  (11)  and  to  its  modern  Wiener-Hopf  frequency 
domain  counterpart.  [12)  Under  reasonable  assumptions  on  plant,  disturbances,  and 
performance  criteria,  these  procedures  yield  efficient  design  compromises.  In  fact,  if 
the  tradeoff  between  command/disturbance  error  reduction  and  sensor  noise  error 
reduction  were  the  only  constraint  on  feedback  design,  practitioners  would  have  little 
to  complain  about  with  respect  to  the  relevance  of  modern  theory.  The  problem  is  that 
these  performance  tradeoffs  are  often  overshadowed  by  a  second  limitation  on  high  loop 
gains  --  namely,  the  requirement  for  tolerance  to  uncertainties.  Though  a  controller 
may  be  designed  using  FDLTI  models,  the  design  must  be  implemented  and  operate  with  a 
real  physical  plant.  The  properties  of  physical  systems,  in  particular  the  ways  in 
which  they  deviate  from  finite-dimensional  linear  models,  put  strict  limitations  on  the 
frequency  range  over  which  the  loop  gains  may  be  large.  In  order  to  properly  motivate 
these  restrictions,  we  digress  in  Section  3  to  a  brief  description  of  the  types  of 
system  uncertainties  most  frequently  encountered.  The  manner  in  which  these 
uncertainties  can  be  accounted  for  in  MIMO  design  then  forms  the  basis  for  the  rest  of 
the  paper. 

IV.  The  Nature  of  Uncertainty 

While  no  nominal  design  model,  G(s),  can  emulate  a  physical  plant  perfectly,  it  is 
clear  that  some  models  do  so  with  greater  fidelity  than  others.  Hence,  no  nominal 
model  should  be  considered  complete  without  some  assessment  of  its  errors.  We  will 
call  these  errors  the  "model  uncertainties”,  and  whatever  mechanism  is  used  to  express 
them  will  be  called  a  "representation  of  uncertainty.” 

Representations  of  uncertainty  vary  primarily  in  terms  of  the  amount  of  structure  the/ 
contain.  This  reflects  both  our  knowledge  of  the  physical  mechanisms  which  cause 
differences  between  model  and  plant  and  our  ability  to  represent  these  mechanisms  in  a 
way  that  facilitates  convenient  manipulation.  For  example,  a  set  membership  statement 
for  the  parameters  of  an  otherwise  known  FDLTI  model  is  a  highly-structured 
representat ■ in  of  uncertainty.  It  typically  arises  from  the  use  of  linear  incremental 
models  at  various  operating  points,  e.g.,  aerodynamic  coefficients  in  flight  control 
vary  with  flight  environment  and  aircraft  configurations,  and  equation  coefficients  in 
power  plant  control  vary  with  aging,  slag  buildup,  coal  composition,  etc.  In  each 
case,  the  amounts  of  variation  and  any  known  relationships  between  parameters  can  be 
expressed  by  confining  the  parameters  to  appropriately  defined  subsets  of  parameter 
space.  A  specific  example  of  such  a  parameterization  for  the  F-8C  aircraft  is  given  in 
113). 

Examples  of  less-structured  representations  of  uncertainty  are  direct  set  membership 
statements  for  the  transfer  function  matrix  of  the  model.  For  instance,  the  statement 

G'  ()<*))  =  G  ( j  in)  +  AG(jw) 


with 


n  |  AG  (  j u> )  ]  <  j2a  (r.i)  -V-  <0  >o  (12) 

wherejJa(")  is  a  positive  scalar  function,  confines  the  matrix  G'  to  a  neighborhood 
of  G  with  magnitude _£a (w) .  The  statement  does  not  imply  a  mechanism  or  structure 
which  qives  rise  to  AG.  The  uncertainty  may  be  caused  by  parameter  changes,  as 
above,  or  oy  neglected  dynamics,  or  by  a  host  of  other  unspecified  effects.  An 
alternative  statement  to  (12)  is  the  so-called  mul t ipl ica t i ve  form: 

G’(jw)  =  |I+L()r.i)  |G(  jl.l> 


with 
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CT !  L  ( ja>)  )  <  Jlm  (w)  -Vu)  >  o  (13) 

This  statement  confines  G'  to  a  normalized  neighborhood  of  G.  An  advantage  of  (13)  over 

(12)  is  that  in  (13)  compensated  transfer  functions  have  the  same  uncertainty 
representation  as  the  raw  model  (i.e.,  the  bound  (13)  applied  to  GK  as  well  as  to  G) . 
Still  other  alternative  set  membership  statements  are  the  inverse  forms  of  (12)  and 

(13)  which  confine  (G')"^  to  direct  or  normalized  neighborhoods  about  G"l. 

The  best  choice  of  uncertainty  representation  for  a  specific  FDLTI  model  depends,  of 
course,  on  the  errors  the  model  makes.  In  practice,  it  is  generally  possible  to 
represent  some  of  these  errors  in  a  highly-structured  parameterized  form.  These  are 
usually  the  low  frequency  error  components.  There  are  always  remaining  higher 
frequency  errors,  however,  which  cannot  be  covered  this  way.  These  are  caused  by  such 
effects  as  inf inite-dimensional  electro-mechanical  resonances  (16,  17],  time  delays, 
diffusion  processes,  etc.  Fortunately,  the  less-structured  representations,  (12)  or 
(13),  are  well  suited  to  represent  this  latter  class  of  errors.  Consequently, 

(12) -(13)  have  become  widely  used  "generic"  uncertainty  representations  for  FDLTI 
models. 

Motivated  by  these  observations,  we  will  focus  throughout  the  rest  of  this  paper 
exclusively  on  the  effects  of  uncertainties  as  represented  by  (13).  For  lack  of  a 
better  name,  we  will  refer  to  these  uncertainties  simply  as  "unstructured."  We  will 
assume  that  G'  in  (13)  remains  a  strictly  proper  FDLTI  system  and  that  G'  has  the  same 
number  of  unstable  modes  as  G.  The  unstable  modes  of  G*  and  G  do  not  need  to  be 
identical,  however,  and  hence  L(s)  may  be  an  unstable  operator.  These  restricted 
assumptions  on  G'  make  exposition  easy.  More  general  perturbations  (e.g,  time  varying, 
infinite  dimensional,  nonlinear)  can  also  be  covered  by  the  bounds  in  (13)  provided 
they  are  given  appropriate  "conic  sector"  interpretations  via  Parseval's  theorem.  This 
connection  is  developed  in  (14,  15]  and  will  not  be  pursued  here. 

When  used  to  represent  the  various  high  frequency  mechanisms  mentioned  above,  the 
bounding  functions  Jfm(w)  in  (13)  commonly  have  the  properties  illustrated  in  Figure 
2.  They  are  small  (<<1)  at  low  frequencies  and  increase  to  unity  and  above  at  higher 
frequencies.  The  growth  with  frequency  inevitably  occurs  because  phase  uncertainties 
eventually  exceed  +  180  degrees  and  magnitude  deviations  eventually  exceed  the  nominal 
transfer  function  magnitudes.  Readers  who  are  skeptical  about  this  reality  are 
encouraged  to  try  a  few  experiments  with  physical  devices. 
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It  should  also  be  noted  that  the  representat ion  of  uncertainty  in  (13)  can  be  used  to 
include  perturbation  effects  that  are  in  fact  not  at  all  certain  A  nonlinear  element, 
for  example,  may  be  quite  accurately  modeled,  but  because  our  design  techniques  cannot 
deal  with  the  nonlinearity  effectively,  it  is  treated  as  a  conic  linearity.  (14,  15) 

As  another  example,  we  may  deliberately  choose  to  iqnore  various  known  dynamic 
characteristics  in  order  to  achieve  a  simpler  nominal  design  model. 

Another  important  point  is  that  the  construction  of  jJm(u))  for  multivariable  systems 
is  not  trivial.  The  bound  assumes  a  single  worst  case  uncertainty  magnitude  applicable 
to  all  channels.  If  substantially  different  levels  of  uncertainty  exist  in  various 
channels,  it  may  be  necessary  to  scale  the  input-output  variables  and/or  apply 
frequency-dependent  transformations  (IS)  in  such  a  way  that  Jlm  becomes  more  uniformly 
tight.  These  scale  factors  and  transformations  are  here  assumed  to  be  part  of  the 
nominal  model  G(s). 

V.  Feedback  Design  in  the  Face  of  Unstructured  Uncertainties 

Once  we  specify  a  design  model,  G(s) ,  and  accept  the  existence  of  unstructured 
uncertainties  in  the  form  (13),  the  feedback  design  problem  becomes  one  of  finding  a 
compensator  K(s)  such  that 


(i) 

(ii) 

(  ixi) 


the  nominal  feedback  system,  GK ( I  +  GK]"1,  is  stable; 

the  perturbed  system,  G'K[I  +  G'K]'1,  is  stable  for  all  possible  G1  allowed 
by  (13);  and 


performance  objectives  are  satisfied  for  all  possible  G‘  allowed  by  (13) 


All  three  of  these  requirements  can  be  interpreted  as  frequency  domain  conditions  on 
the  nominal  loop  transfer  matrix,  GK(s),  which  the  designer  must  attempt  to  satisfy. 


Stability  Conditions 


The  frequency  domain  conditions  for  Requirement  (i)  are,  of  course,  well  known.  In 
SISO  cases,  they  take  the  form  of  the  standard  Nyquist  Criterion,  [See  any  classical 
control  text.)  and  in  MIMO  cases,  they  in/olve  its  multivariable  generalization.  [18) 
Namely,  we  require  that  the  encirclement  count  of  the  map  det  [I  +  GK ( s ) ) ,  evaluated  on 
the  standard  Nyquist  D-contour,  be  equal  to  the  (negative)  number  of  unstable  open  loop 
modes  of  GK. 


Similarly,  for  Requirement  (ii)  the  number  of  encirclements  of  the  map  det  [I  +  G’K(s)] 
must  equal  the  (negative)  number  of  unstable  modes  of  G'K.  Under  our  assumptions  on 
G',  however,  this  number  is  the  same  as  that  of  GK.  Hence,  Requirement  (ii)  is 
satisfied  if  and  only  if  the  number  of  encirclements  of  det  [I  +  G'K(s))  remains 
unchanged  for  all  G'  allowed  by  (13).  This  is  assured  if  det  [I  +  G'K)  remains  nonzero 
as  G  is  warped  continuously  toward  G',  or  equivalently,  iff 

0  <  a(  +  [ I+eL ( s) )G(s)K(s) )  (14) 

for  all  o  £  e  £  1,  all  s  on  the  D-contour,  and  all  L(s)  satisfying  (13).  Since 
G'  vanishes  on  the  infinite  segment  of  the  D-contour,  and  assuming,  for  simplicity, 
that  the  contour  requires  no  indentations  along  the  ju-axis,  (If  indentations  are 
required,  (14)  and  (17)  must  hold  in  the  limit  for  all  s  on  the  indented  path  as  the 
radius  of  indentation  is  taken  to  zero)  equation  (14)  reduces  to  the  following 
equivalent  conditions: 


o  < 

0 

(I+G(  jw)K(  jw)  +  e  L(jw)  G  (  ja>)  K  (  j  co )  ] 

(15) 

for 

all 

o 

£  f.  <_  lf  o£a)<a>,  and  all  L 

<£=> 

0  < 

a 

[I  +  L  GK ( I+GK) _1) 

(16) 

for 

all 

o 

£(i)<«,  and  all  L 

** 

v  1 

GK  i 

(  I +GK ) “ 1 1  <  1/  m (w) 

(17) 

for 

all 

0 

<«*»<«» 

The  last  of  these  equations  is  the  MIMO  generalization  of  the  familiar  SISO  requirement 
that  loop  gains  be  small  whenever  the  magnitude  of  unstructured  uncertainties  is 
large.  In  fact,  whenever j£m  (w)  >>  1,  we  get  the  following  constraint  on  GK : 

TT  [  GK  (  j cu )  I  <  1/J?m  (u>)  (18) 

for  all  fn  such  that  j2m(oj)  >>  1. 

Note  that  these  are  not  conservative  stability  conditions.  On  the  contrary,  if  the 
uncertainties  are  truly  unstructured  and  (17)  is  violated,  then  there  exists  a 
perturbation  L(s)  within  the  set  allowed  by  (13)  for  which  the  system  is  unstable. 
Hence,  these  stability  conditions  impose  hard  limits  on  the  permissible  loop  gains  of 
practical  feedback  systems. 
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Performance  Conditions 

Frequency  domain  conditions  for  Requirement  (iii)  have  already  been  described  in 
Section  2,  equation  (5).  The  modification  needed  to  account  for  unstructured 
uncertainties  is  to  apply  (5)  to  G‘  instead  of  G;  i.e., 

ps  <  o  [ 1+ ( I+L) GK | 

<=  ps  <  o  ( I +-L  GKd+GK)"1]  a  [I+GK] 

4—  PS((J)  <  £  iGK(ju)  |  (19) 

for  all  u>  sucn  that  fm(ui)  <  1  and  i  (GK(ju)]  >>  1. 

This  is  the  MIMO  generalization  of  another  familiar  SISO  design  rule  --  namely  that 
performance  objectives  can  be  met  in  the  face  of  unstructured  uncertainties  if  the 
nominal  loop  gains  are  made  sufficiently  large  to  compensate  for  model  variations. 

Note,  however,  that  finite  solutions  exist  only  in  the  frequency  range  where 

1. 

The  stability  and  performance  conditions  derived  above  illustrate  that  MIMO  feedback 
design  problems  do  not  differ  fundamentally  from  their  SISO  counterparts.  In  both 
cases,  stability  must  be  achieved  nominally  and  assured  for  all  perturbations  by 
satisfying  conditions  (17-18).  Performance  may  then  be  optimized  by  satisfying 
condition  (19)  as  well  as  possible.  What  distinguishes  MIMO  from  SISO  design 
conditions  are  the  functions  used  to  express  transfer  function  “size.'1  Singular  values 
replace  absolute  values.  The  underlying  concepts  remain  the  same. 

We  note  that  the  singular  value  functions  used  in  our  statements  of  design  conditions 
play  a  design  role  much  like  classical  Bode  plots.  The  o[I  +  GK 1  function  in  (5)  is 
the  minimum  return  difference  magnitude  of  the  closed  loop  system,  n[GK)  in  (8)  and 
o [GK ]  in  (18)  are  minimum  and  maximum  loop  gains,  and  n[GK(I  +  GK)"r]  in  (17)  is 
the  maximum  closed  loop  frequency  response.  These  can  all  be  plotted  as  ordinary 
frequency  dependent  functions  in  order  to  display  and  analyze  the  features  of  a 
multivariable  design.  Such  plots  will  here  be  called  o-plots. 

One  of  the  o-plots  which  is  particularly  significant  with  regard  to  design  for 
uncertainties  is  obtained  by  inverting  condition  (17),  i.e., 


(GK(I +GK) 


=  o  1 1+  (GK  (  jui)  )  -  1 1 


for  all  o  *  u>  <  ■».  The  function  on  the  right  hand  side  of  this  expression  is  an 
explicit  measure  of  the  degree  of  stability  (or  stability  robustness)  of  the  feedback 
system.  Stability  is  guaranteed  for  all  perturbations  L(s)  whose  maximum  singular 
values  fall  below  it.  This  can  iclude  gain  or  phase  changes  in  individual  output 
channels,  simultaneous  changes  in  several  channels,  and  various  other  kinds  of 
perturbations.  In  effect,  a(l  +  (GK)"1]  is  a  reliable  multivariable  generalization 
of  SISO  stability  margin  concepts  (e.g.,  frequency  dependent  gain  and  phase  margins). 
Unlike  the  SISO  case,  however,  it  is  important  to  note  that  o[I  +  (GK)*1)  measures 
tolerances  for  uncertainties  at  the  plant  outputs  only. 


Tolerances  for  uncertainties  at  the  input  are  generally  not  the  same.  They  can  be 
analyzed  with  equal  ease,  however,  by  using  the  function  o[I  +  (KG)"1]  instead  of 
0[I  +  (GK)"1]  in  (20).  This  can  be  readily  verified  by  evaluating  the  encirclement 
count  of  the  map  det  (I  +  KG)  under  perturbations  of  the  form  G'  =  G(I  +  L)  (i.e., 
uncertainties  reflected  to  the  input).  The  mathematical  steps  are  directly  analogous 
to  (15-18)  above.  Classical  designers  will  recognize,  of  course,  that  the  difference 
between  these  two  stability  robustness  measures  is  simply  that  each  uses  a  loop 
transfer  function  appropriate  for  the  loop-breaking  point  at  which  robustness  is  being 
tested.  The  relationship  between  input  and  output  margins  is  discussed  further  in  the 
companion  paper  [19]. 


The  feedback  desiqn  conditions  derived  above  can  be  related  directly  to  the  classical 
feedback  problem  as  formulated  by  Bode  ([5],  [6]).  This  problem  is  pictured 
graphically  for  the  SISO  case  in  Figure  3  using  the  well-known  Bode  gain  plot.  Note 
the  low  and  high  frequency  gain  conditions.  The  designer  must  find  a  loop  transfer 
function,  GK,  for  which  the  loop  is  nominally  stable  and  whose  gain  clears  the  high  and 
low  frequency  "desiqn  boundaries"  given  by  Condition  (17)  and  (19).  The  high  frequency 
boundary  is  mandatoary,  while  the  low  frequency  one  is  desirable  for  good  performance. 
Both  are  influenced  by  the  uncertainty  bound,  l.ra ( ui)  . 


A  representative  loop  transfer  function  is  also  sketched  in  the  figure.  As  shown,  the 
effective  bandwidth  of  the  loop  cannot  fall  much  beyond  the  frequency  u0  for  which 
tm(uj0)  3  !•  Note  also  that  phase  is  not  pictured  nor  mentioned  explicitly. 

What  is  needed  is  stability  and  well-behaved  crossover.  For  SISO  systems,  phase 
provides  a  very  convenient  way  to  evaluate  stability  and  crossover  properties  but  is 
not  important  in  and  of  itself.  Furthermore  the  phase  of  a  rational  function  is 
completely  determined  by  its  gain,  and  location  of  rhp  poles  and  zeros,  as  discussed  in 
the  companion  paper.  [19]  Thus,  for  evaluating  the  feedback  properties  of  a  SISO 
control  design,  phase  is  not  needed  explicitly,  given  that  stability  is  checked  and 
plots  of  11  +  gk  l  and  11  +  1/gk I  or  lgk(l  +  g k ) " 1 1  are  provided. 


.vs 


Figure  3.  The  Feedback  Design  Problem  According  to  Bode 


The  o-plot  generalization  of  the  Bode  gain  plot  and  its  interpretation  are  displayed 
graphically  in  Figure  4.  The  performance  conditions  are  essentially  the  same  as  for 
the  Bode  feedback  design  problem  with  singular  values  replacing  complex  magnitude.  As 
in  the  SISO  case  phase  is  not  needed.  Thus  the  fact  that  there  is  no  natural 
multivariabl e  generalization  of  phase  should  not  be  disturbing,  because  the  important 
feedback  properties  of  a  system  do  not  depend  explicitly  on  phase. 


Figure  4.  The  Multivariable  Feedback  Design  Problem 


The  o-plots  of  a  representative  loop  transfer  matrix  are  also  sketched  in  the 
figure.  As  in  the  SISO  case  the  effective  bandwidth  of  the  loop  cannot  fall  much 
beyond  the  frequency  u>0  for  which  ^m(wQ)  =  1.  As  a  result,  the  frequency 
range  over  which  performance  objectives  can  be  met  is  explicitly  constrained  by  the 
uncertainties.  It  is  also  evident  from  the  sketch  that  the  severity  of  this  constraint 
depends  on  the  rate  at  which  o[GK|  and  o[GK)  are  attenuated.  The  steeper  these 
functions  drop  off,  the  wider  the  frequency  range  over  which  Condition  (19)  can  be 
satisfied.  Unfortunately,  FDLTI  transfer  functions  behave  in  such  a  way  that  steep 
attenuation  comes  only  at  the  expense  of  small  o(I  +  GK)  values  and  small 
a(I  +  (GK)"1|  values  when  o[GK)  and  o(GK)  i  1.  This  means  that  while 
performance  is  good  at  lower  frequencies  and  stability  robustness  is  good  at  higher 
frequencies,  both  are  poor  near  crossover.  The  behavior  of  FDLTI  transfer  functions, 
therefore,  imposes  a  second  major  limitation  on  the  achievable  performance  of  feedback 
systems.  These  and  other  limitations  on  achievable  performance  will  be  considered  in 
the  companion  paper  (19). 
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VI.  Examples 

Two  examples  will  now  be  discussed  to  illustrate  the  ideas  presented  in  the  previous 
sections.  Both  examples  are  simple  and  of  low  order.  Hopefully,  this  will  encourage 
the  reader  to  experiment  with  the  examples  to  get  a  better  understanding  of  the  plots 
involved. 

The  first  example  is  a  two  input  oscillator  with  open  loop  poles  at  +  10 j  and  both 
closed  loop  poles  under  unity  feedback  at  -1.  The  loop  transfer  function  is 

6 (s)  =  -  Is  -  100  10<S  +  (21) 

S  +  100  |  —  1 0 ( s  +  1)  s  -  100  j 

and  with  unity  feedback,  K  =  I.  There  are  no  transmission  zeros. 

Suppose  we  begin  the  analysis  by  breaking  individual  loops  at  the  inputs  and  viewing 
the  multiloop  system  as  two  single  loop  systems.  With  either  loop  closed  (the  system 

is  symmetric)  the  transfer  function  for  the  other  loop  is 

g(s>  =| 

which  indicates  that  individually  the  loops  have  large  classical  SISO  margins  at 
crossover  (  +  <®  db  gain  and  90°  phase  at  a>  =  1)  . 

The  singular  values  of  GK ( I  +  GK)"1  are  plotted  in  Figure  5.  From  equation  (17),  we 

see  that  the  peak  in  a  near  20  radians,  indicates  that  there  exists  a  small 

perturbation  (j|m  =  .1)  that  would  create  an  instability.  Thus,  while  each  loop 
individually  may  have  large  stability  margins,  the  feedback  system  is  extremely 
sensitive  to  perturbations  which  effect  both  loops. 


log  frequency 

Figure  5.  o-Plot  for  example  1 

To  get  further  insight  into  this  example,  consider  a  diagonal  perturbation  (as  in  (13)) 


where  k^  and  k2  are  real  constants.  The  regions  of  stability  and  instability  may 
be  plotted  in  the  (kj,  k2>  plane  as  shown  in  Figure  6.  Gain-plane  plots  such  as 
these  have  been  used  by  engineers  for  decades,  but  they  unfortunately  do  not  easily 
generalize  to  higher  dimensions. 

In  Figure  6,  the  open  loop  point  is  kj  «  k2  *  -1  and  the  nominal  closed  loop  point 
is  kj  *  k2  «  0.  If  each  loop  is  broken  individually,  the  kj,  k2  axes  are 
checked  for  stability.  As  can  be  seen  from  Figure  6  this  would  miss  the  unstable 
region  near  the  origin  caused  by  simultaneous  changes  in  k^  and  k2.  Singular  value 
analysis  checks  in  all  directions  and  gives  a  more  reliable  characterization  of  the 
robustness  of  a  feedback  system. 


.MO 


Figure  6.  Example  1  Stability  Domain 

As  a  second  example  which  is  more  physically  motivated,  we  will  consider  a  longitudinal 
-axes  design  problem  for  the  CH-47  helicopter.  This  vehicle  is  a  tandem  rotor  machine 
whose  physical  characteristics  and  mathematical  models  are  given  in  [20)  and  [21]. 
Control  over  vertical  motions  is  achieved  by  simultaneous  changes  of  blade 
angle-of -attack  on  both  rotors  (collective) ,  while  pitch  and  forward  motions  are 
controlled  by  changing  blade  angle  differentially  between  the  two  rotors  (differential 
-collective) .  These  blade  angle  changes  are  transformed  through  rotor  dynamics  and 
aerodynamics  into  hub  forces  which  then  move  the  machine. 

Our  objectives  will  be  to  design  the  feedback  portion  of  a  command  augmentation  control 
law  which  achieves  tight,  non-interacting  control  of  the  vertical  velocity  and  pitch 
attitude  responses.  This  design  was  considered  previously  in  [4] .  A  small 
perturbation  linearized  aircraft  model  should  prove  adequate  for  this  purpose  and  is 
available  [201.  The  state  vector  consists  of  the  vehicle's  basic  rigid  body  variables 
x  »  (V,  z,  q,  0)  (forward  velocity,  vertical  velocity,  pitch  rate,  pitch  angle).  Two 
integrators  are  appended  to  provide  desired  low  frequency  gain.  These  would  also  be 
used  to  achieve  integral  control  of  the  primary  responses,  Z  and  0,  in  the  full 
command  augmentation  system.  The  controls  are  the  collective  and  differential- 
collective  inputs  described  above:  u  =  (c,  dc) . 

The  major  approximations  associated  with  this  linearized  model  are  due  to  both 
neglected  dynamics  of  the  rotors,  neglected  nonlinearities  in  the  blade  angle  actuation 
hardware,  as  well  as  changes  in  flight  condition.  We  will  focus  primarily  on  errors 
due  to  neglected  roto,  dynamics,  and  restrict  attention  to  a  single  flight  condition, 

40  knots  forward  airspeed.  This  allows  for  a  simple  illustration  of  the  ideas  of  the 
previous  sections,  without  sacrificing  the  engineering  relevance  of  this  example. 

Hence,  our  nominal  model  is 


x  =  Ax  +  Bu 

(A,  B  given  in  the  Appendix) 

(23) 

X5  =  z 

(24) 

x6  =  0 

(25) 

The  first  step  in  the  design  is  to  escimate  the  errors  due  to  neglected  rotor 
dynamics.  Elementary  dynamic  and  aerodynamic  analyses  of  rotating  airfoils,  hinged  at 
the  rotor  hub,  indicate  that  lift  forces  will  not  be  transmitted  to  the  hub 
instantaneously  with  collective  changes  in  blade  angle-of-attack  but  will  appear  only 
when  the  cone  angle  of  the  rotor  has  appropriately  changed.  The  dynamics  of  the  latter 
have  been  shown  to  be  damped  second-order  oscillations  with  natural  frequency  equal  to 
rotor  speed  and  damping  determined  by  somewhat  uncertain  aerodynamic  effects  1221. 
Hence,  rotor  dynamics  can  be  crudely  represented  by  second-order  transfer  functions 


(26) 
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with  u)R  =  25  rad/sec  and  conservatively  confined  to  the  range  0.1  <  <  1.0. 

Because  collective  and  differential-collective  inputs  both  involve  coning  motions  of 
the  rotors,  one  such  transfer  function  will  appear  in  each  control  channel. 

The  rotor  dynamics  can  be  represented  as  a  multiplicative  perturbation  (L  in  Eq.  (13) 
where 


This  function  was  evaluated  for  a  range  of  frequencies  and  is  shown  in  Figure  7.  From 
equations  (17)  and  (18)  we  know  that  this  uncertainty  level  will  limit  bandwidth  to 
less  than  10  rad/sec. 


An  alternative  approach  would  be  to  explicitly  add  the  rotor  dynamics  to  the  nominal 
model  and  represent  the  uncertain  damping  by  a  variable  parameter,  as  in  eq.  (26). 

This  would  eliminate  the  perturbation  due  to  the  first  mode  of  the  rotor  and  allow  for 
a  higher  bandwidth  controller.  Unfortunately,  such  a  high-bandwidth  controller  would 
require  modification  and  more  accurate  modeling  of  the  blade  angle  actuation  hardware, 
as  well  as  additional  modeling  of  higher-frequency  rotor  dynamics  (second  and  higher 
harmonics) .  An  engineer  considering  this  option  would  have  to  weigh  its  cost  against 
the  benefits  of  a  high  bandwidth  controller. 

In  any  case,  even  with  more  accurate  modeling  and  hardware  improvements,  the  feedback 
designer  would  still  be  faced  with  large  unstructured  uncertainty,  (possibly  at  higher 
frequencies,)  due  to  neglected  dynamics.  This  tradeoff,  where  more  costly  and  accurate 
modeling  allows  for  more  accurate  control,  is  a  central  issue  in  engineering  modeling 
problems.  By  using  the  model  of  eq.  (23)  -  (25),  we  have  opted  for  modeling  only  the 
basic  rigid  body  dynamics  explicitly  and  including  other  effects  in  the  unstructured 
uncertainty. 

By  combining  eq.  (27)  with  eqs.  (17)  -  (19)  we  obtain  the  bounds  on  the  singular  values 
of  the  loop  transfer  function  shown  in  Figure  8.  The  high  frequency  constraint  is 
obtained  from  the  neglected  rotor  dynamics  in  Figure  7.  The  constraint  has  been  made 
linear  for  convenience,  and  continues  downward  beyond  the  25  rad/sec  peak  because 
higher  order  harmonics  and  other  uncertainties  will  dominate  at  these  higher 
frequencies.  We  expect  that  second  order  attenuation  characteristics  beyond  25  rad/sec 
will  be  adequate  to  provide  stability  in  the  face  of  these  additional  uncertainties. 

The  low  frequency  constraints  should  provide  good  command  response  and  disturbance 
rejection  as  well  as  desensitizing  the  controller  to  changes  in  flight  conditions. 


Figure  7.  Uncertainty  Bound  for  Rotor  Figure  8.  o-plot  Constraints  for  CH-47  Design 


To  strengthen  the  connection  of  the  o-plots  with  classical  SISO  techniques  and  Bode 
plot  methods,  we  will  first  treat  a  single-loop  problem  of  controlling  pitch  attitude 
(0)  with  differential  collective  (dc) .  While  this  loop  decouples  from  the  vertical 


velocity/collective  loop  at  hover,  they  are  highly  coupled  in  the  40  knots  forward 
flight  condition  so  a  single  loop  design  would  not  be  recommended  here  except  for 
illustration  purposes.  The  Bode  gain  plots  of  three  alternative  designs  with  loops 
broken  at  the  input  are  shown  in  Figure  9.  These  controllers  are  all  state  feedback 
(plus  dynamic  compensation  in  case  lc)  since  for  the  CH-47  all  the  states  in  the 
nominal  model  are  measurable.  The  design  technique  used  is  described  briefly  in  the 
Appendix. 

Trial  la  controller  violates  the  low  frequency  condition  while  Trial  lb  violates  the 
high  frequency  condition.  By  adding  the  second  order  rotor  dynamics  from  (26)  we  can 
easily  verify  that  Trial  lb  is  unstable  for  =  .1.  Trial  lc  is  an  intermediate  design 
which  includes  a  low-pass  filter  at  u  =  12  rad/sec  to  help  avoid  the  rotor 
uncertainty.  Plots  of  II  +  gl  and  lg(l  +  gf^1  for  Trial  lc  are  shown  in 
Figure  10.  Crossover  is  well-behaved  since  11  +  gl  does  not  get  too  small  and 
lg  (1  +  g)"f|  does  not  get  too  large.  Recall  that  I g ( 1  +  g)"'f|  is  the 

magnitude  of  the  closed  loop  response  or  the  feedback  signal  to  the  input.  Thus  there 
would  be  little  peaking  of  the  response.  With  the  addition  of  Figure  10,  Figure  9  is 
unnecessary,  since  the  low  and  high  frequency  as  well  as  crossover  characteristics  can 
be  obtained  from  Figure  10  and  equations  (5)  and  (17). 


The  beauty  of  singular  value  analysis  is  that  the  above  analysis  for  SISO  systems 
carries  over  without  change  to  MIMO  systems.  This  is  illustrated  in  Figures  11  and  12 
with  two  trial  two-input  designs.  The  design  technique  used  is  described  briefly  in 
the  Appendix.  This  distinction  between  Figures  11  and  12  and  Figure  9  is  that  for  the 
two-input  designs,  two  plots  are  needed  for  each  trial.  Condition  (18)  implies  that 
o  (and  thus  all  cr)  must  lie  below  the  high  frequency  constraint  and  condition  (19) 
implies  o  (and  thus  all  o)  must  lie  above  the  low  frequency  constraint.  Trial  2a 
(Figure  11)  violates  both  conditions  while  Trial  2b  (Figure  12)  satisfies  both.  Both 
trials  use  state  feedback  but  Trial  2b  includes  a  first  order  lag  at  id  =  12  rad/sec 
to  provide  sufficient  attenuation  to  avoid  the  uncertainty  bounds.  The  n-plots  of  (I 
+  G)  and  G(I  +  G)"l  for  Trial  2b  are  shown  in  Figure  13.  These  can  be  interpreted  in 
terms  of  conditions  (5)  and  (17)  in  the  same  way  as  for  Trial  lc  in  Figure  10. 


Which  c-plots  an  engineer  chooses  to  use  is  somewhat  a  matter  of  taste  and  would 
depend  in  part  on  the  design  technique  being  used  and  on  the  stage  of  the  design. 
Although  we  did  not  explicitly  design  for  other  fliqht  conditions,  the  low  frequency 
gain  conditions  provide  considerable  robustness  with  respect  to  flight  condition 
variations.  Trial  2b  remains  stable  at  eight  representative  flight  conditions  from 
hover  to  160-knots  forward  speed  and  from  -2000  ft/min  to  +2000  ft/min  ascent  rates. 

It  must  be  emphasized,  however,  that  these  designs  were  for  illustration  purposes  only 
and  are  not  intended  to  be  flight  quality  designs.  Trial  2b  might  be  considered  as  a 
candidate  control  law  but  would  require  implementation  and  verification  through 
high-fidelity  simulation  and  flight  test  before  qualifying  as  a  flight  quality  design. 
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Figure  11.  o-Plots  of  G  for  Trial  2a 


Figure  12.  o-Plots  of  G  for  Trial  2b 


VII.  Conclusions 

This  paper  has  attempted  to  present  a  practical  design  perspective  on  multiloop  linear 
time  invariant  feedback  control  problems.  It  has  focused  on  the  fundamental  issue  -- 
feedback  in  the  face  of  uncertainties  --  and  has  shown  how  classical  SISO  approaches  to 
this  issue  can  be  reliably  and  meaningfully  generalized  to  MIMO  systems. 

There  were  many  other  topics  not  considered  in  this  paper  which  are  less  fundamental  to 
the  feedback  problem  but  still  very  important.  For  example,  we  did  not  treat  certain 
performance  objectives  in  MIMO  systems  which  are  distinct  from  SISO  systems.  These 
include  perfect  non-interaction  and  integrity.  Non-interaction  is  again  a  structural 
property  which  loses  most  of  its  meaning  in  the  face  of  unstructured  uncertainties. 

(It  is  achieved  as  well  as  possible  by  Condition  19).  Integrity,  on  the  other  hand, 
cannot  be  dismissed  as  lightly.  It  concerns  the  ability  of  MIMO  systems  to  maintain 
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stability 
described 
integr i ty 


in  the  face  of  actuator  and/or  sensor  failures, 
here  are  indeed  useful  for  integrity  analysis, 
with  respect  to  actuator  failures  whenever 


The  singular  value  concepts 
For  example,  a  design  has 


o(I  +  (KG)  -1)  >  1  V  u> 


This  follows  because  failures  satisfy  Jm  _<  1. 

The  major  limitations  on  what  has  been  said  in  the  paper  are  associated  with  the 
representation  chosen  in  Section  4  for  unstructured  uncertainty.  A  single  magnitude 
bound  on  matrix  perturbations  is  a  worst-case  representation  which  is  often  much  too 
conservative  (i.e.,  it  may  admit  perturbations  which  are  structurally  known  not  to 
occur).  The  use  of  weighted  norms  in  (8)  -  (9)  or  selective  transformations  applied  to 
G  (as  in  (23))  can  alleviate  this  conservatism  somewhat,  but  seldom  completely. 

A  related  drawback  is  the  implicit  assumption  that  all  loops  (all  directions)  of  the 
MIMO  system  should  have  equal  bandwidth  (a  close  to  o  in  Figure  4) .  This 
assumption  is  consistent  with  a  uniform  uncertainty  bound  but  is  not  appropriate  for 
more  complex  uncertainty  structures.  These  issues  are  important  and  complex  enough  to 
deserve  more  attention  in  their  own  right  than  can  be  given  here.  Research  along  these 
lines  is  proceeding. 


This  paper  has  not  addressed  the  problem  of  actually  synthesizing  controllers  to 
satisfy  the  design  conditions  developed.  The  approach  taken  here  has  been  to  examine 
fundamental  feedback  issues  and  provide  reliable  and  effective  tools  for  analyzing  the 
feedback  properties  of  multiloop  systems.  These  issues  are  independent  of  the  specific 
synthesis  tools  used  and  any  technique  which  can  provide  a  controller  satisfying  the 
constraints  for  a  particular  problem  will  be  adequate.  Unfortunately,  most  existing 
synthesis  techniques  are  oriented  at  manipulating  quantities  which  are  at  best  only 
indirectly  related  to  feedback  properties.  An  exception  to  this  is  the  technique 
described  in  [1]  and  in  more  detail  in  two  papers  by  Stein  ((24),  [25])  in  this  lecture 
series.  The  reader  is  encouraged  to  consider  the  techniques  described  in  these  papers 
by  Stein  in  the  context  of  the  feedback  design  problem  as  outlined  in  the  preceeding 
sections  of  this  paper. 

The  companion  paper  [19]  addresses  the  achievable  performance  of  a  feedback  system  in 
the  face  of  uncertainty.  The  sources  of  limitations  on  achievable  performance 
considered  are: 


1)  The  algebraic  conflict  between  providing  desensitization  and  insuring 
stability  in  the  presence  of  large  uncertainty. 

2)  The  functional  relationship  between  gain  and  phase  for  causal,  rational 
transfer  functions. 

3)  Right  hand  plane  transmission  zeros 

4)  Directionality  conflicts 

The  last  issue  is  a  MIMO  problem  that  has  no  SISO  analogue  while  the  first  three  are 
MIMO  generalization  of  classical  SISO  properties. 
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APPENDIX 


The  nominal  model  for  the  dynamics  of  the  CH-47  at  40  knots  forward  airspeed  has  (21) 
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for  A  and  B  in  equation  (23). 

The  example  designs  were  all  state  feedback  including  integrators  (and  lag  filters  in 
Trials  lc  and  2b).  The  state  feedback  gains  were  obtained  using  linear-quadratic 
methodology,  selected  primarily  for  its  convenience  and  good  frequency-domain 
properties  (see  (1),  (24),  (25)).  In  the  context  of  feedback  design,  time-domain 
optimality,  per  se,  is  of  no  interest.  The  state  feedback  gains  for  Trail  2b  (states  5 
and  6  are  integrators)  are 

K  =  I-4.72X10'4  .0259  12.6  47.9  .0417  -57.2  | 

1-0163  -.  535  .  231  2.  16  .  999  2 . 39  J 

Trial  2b  also  has  first  order  lags 

1 


1  +  s/12 

added  to  each  channel  to  provide  additional  high-frequency  attenuation. 
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Summary 

This  paper  examines  the  fundamental  limitations  on  the  achievable  performance  of 
multivariable  feedback  systems.  Using  the  singular  value  analysis  techniques  discussed 
in  the  companion  paper  (1|,  the  design  tradeoffs  that  are  the  focus  of  classical 
single-loop  theory  are  generalized  to  multiloop  systems.  The  first  tradeoff  is 
algebraic  between  large  Poop  gain  for  good  sensitivity  properties  and  small  loop  gain 
for  stability  margins.  Another  is  the  functional  tradeoff  imposed  by  the  Bode 
gain/phase  relations.  These  tradeoffs  impose  limitations  on  the  achievable  performance 
of  any  feedback  design.  The  limitations  caused  by  nonminimum  phase  zeros  is  also 
discussed.  Finally,  the  problem  of  directionality  in  multiloop  systems  is  presented 
and  analyzed.  This  is  a  uniquely  multiloop  problem  and  has  no  analog  in  single  loop 
feedback  systems. 


1.  Introduction 

Over  the  last  fifteen  years,  a  misconception  has  arisen  which  implies  that  the 
application  of  advanced  control  concepts  can  control  any  system  to  any  level  of  desired 
performance.  This  misconception  can  be  traced  in  part  to  the  aura  that  originally 
surrounded  optimal  control  and  estimation  theory.  The  notions  that  a  design  could  be 
optimal,  poles  could  be  placed  arbitrarily,  and  that  variables  that  could  not  be 
measured  could  be  estimated  contributed  heavily  to  the  misconception.  In  reacting  to 
the  promises  of  optimal  control  and  estimation,  many  designers  lost  sight  of  the  true 
nature  of  the  control  problem.  Control  systems  are  designed  to  satisfy  performance 
specifications.  Performance  is  limited  by  the  nature  of  the  control  application  and 
the  actuation  and  measurement  devices  available.  Feedback  control  systems  possess 
severe  performance  limitations  because  of  the  potential  for  destabilization  and  its 
consequences. 

This  paper  examines  limitations  on  the  achievable  performance  of  multivariable  feedback 
systems.  The  notion  of  performance  that  is  used  in  this  paper  is  the  desensitizing 
effect  that  feedback  can  have  on  uncertainty  in  the  plant. 

Section  2  reviews  the  fundamental  properties  of  feedback  systems  involving  uncertainty 
using  the  techniques  presented  in  (1).  From  this  basis  is  developed  the  algebraic 
conflict  between  high  desensitization  and  large  stability  margins.  In  Section  3,  the 
Bode  gain/phase  relations  are  generalized  to  multivariable  systems.  These  relations 
impose  a  functional  tradeoff  between  the  rate  of  attenuation  of  the  loop  gain  and  the 
stability  margins  at  crossover.  This  result  is  used  in  Section  4  to  study  the  impact 
on  performance  of  transmission  zeros  in  the  right  half  plane. 

In  Section  5,  the  issue  of  directionality  in  multivariable  feedback  systems  is 
explored.  It  is  shown  that  under  some  circumstances  there  may  be  a  conflict  between 
margins  at  the  input  and  at  the  output  of  the  plant.  Section  6  has  the  conclusions. 

2.  Algebraic  Constraints  on  Feedback  Systems 

Consider  the  feedback  configuration  shown  in  Figure  1.  This  represents  a  (nominally) 
stable  closed-loop  system  consisting  of  a  linear  t ime- invar iant  plant  G(s!  and 
controller  K(s).  In  this  section,  we  will  discuss  multivariable  generalizations  of 
some  classical  properties  of  single-loop  feedback  systems. 
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Figure  1. 
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One  use  of  feedback  is  to  reduce  sensitivity  to  changes  in  the  plant.  It  is  well-known 
that  a  useful  sensitivity  function  for  a  single-loop  feedback  system  is  the  inverse  of 
the  return  difference.  To  quantify  the  desensitizing  effect  of  feedback  in  the 
multivariable  case,  we  use  the  comparison  sensitivity  approach  (4).  For  the 
comparison,  let  G  represent  the  nominal  plant  and  G(I+A)"1  represent  the  perturbed 
plant.  To  obtain  a  direct  comparison  of  sensitivity  with  and  without  feedback,  we 
consider  the  feedback  system  with  precompensation  P  =  (I+KG).  This  precompensation  is 
added  so  that  the  closed-loop  transfer  function  for  the  nominal  plant  is  identical  with 
the  nominal  open-loop  transfer  function,  G.  The  perturbed  closed-loop  transfer 
function  is 


G(I  +  A) -J-(I+KG(I  +  A)  ) -1(I+KG)  =  G  ( 1+  ( I  +KG)  "  !a>  *  1 

G(I+  Acl) 

where  the  effective  closed-loop  perturbations  A^l  is 
Acl  =  ( I+KG) A 


2.  1 


2.2 


Thus  for  multivariable  systems,  the  inverse  return  difference  matrix,  (I+KG)~1, 
relates  the  sensitivity  of  the  closed-loop  system  to  that  of  the  open-loop  system.  In 
terms  of  the  spectral  norm,  if  the  condition 


1  >  o[ (I+KG)"1| 
or,  equivalently 


o(I+KG) >1 


2.3 


2.4 


holds  at  some  frequency,  then  the  use  of  feedback  accomplishes  desensitization  at  that 
frequency. 

Feedback  can  also  reduce  the  effect  of  a  disturbance,  d,  on  the  output,  y  (Figure  1) . 
The  response  at  the  output  to  this  disturbance  is 


y  =  (I+KG)-1d 
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Since  the  open-loop  response  to  such  a  disturbance  is  simply  d,  satisfaction  of 
condition  (2.4)  at  some  frequencies  means  that  feedback  provides  some  disturbance 
rejection  at  those  frequencies. 

In  terms  of  sensitivity  and  disturbance  rejection,  good  performance  of  a 
multivariable  (as  well  as  single-loop)  feedback  system  translates  into  the  requirement 
of  a  "large"  return  difference  matrix.  Since 


lo(I+KG)  -  o (KG) I <1 
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this  is  equivalent  to  requiring  "large"  loop  gain.  A  more  detailed  discussion  of  the 
relationship  between  performance  and  loop  gain  may  be  found  in  the  companion  paper 
[1].  We  will  return  to  the  desideratum  of  large  loop  gain  after  considering  stability 
margins  for  multivariable  feedback  systems. 

The  representation  of  uncertainty  as  (I  +  A)'l  is  convenient  for  studying 
sensitivity.  Some  uncertainties  in  physical  systems,  however,  cannot  be  represented  in 
this  way.  In  particular,  both  gain  and  phase  become  highly  uncertain  at  high 
frequencies.  Modeling  the  plant  with  uncertainty  as  G(I+A)  is  quite  useful  in  these 
situations  ( 1 1 . 

Using  this  representation  for  the  plant  with  uncertainty  in  the  multivariable 
feedback  system  of  Figute  1,  the  closed-loop  system  remains  stable  for  all  (stable) 
perturbations  A(s)  satisfying 


o(A(jw))<  a  [  I  +  (K  ( jw)  G  ( jui)  )  ' 1 )  V 
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The  size  of  I+(KG)~1  measures  the  robustness  of  closed-loop  stability  with  respect  to 
multiplicative  perturbations  (I+A).  This  is  directly  analogous  to  the  distance  to 
the  critical  point  on  an  inverse  Nyquist  diagram,  ll+l/KGl. 

Adequate  representations  of  physical  systems  inevitably  require  that  the  perturbation 
become  large  at  high  frequency,  and  so  large  (KG)-^  (or,  equivalently,  small  loop 
gain  KG)  is  required  at  these  frequencies  to  ensure  stability. 

This  last  statement  leads  to  a  trade-off  in  the  design  of  feedback  systems.  At  a 
particular  frequency,  it  is  possible  to  have  large  loop  gain  for  desensitization  and 
disturbance  rejection  or  small  loop  gain  for  stability  margins  --  but  not  both.  More 
precisely,  consider  the  identity 


(I+KG)-1  +  ( ( I + ( KG) ~ 1 ) _ 1 )  =  I 


A  direct  consequence  of  this  identity  is  the  strinq  of  inequalities 
a(I+L) _  „  „  <-  fl(I+L) 


o(I+L)  +1 


o  ( I+L) - 1 


2.8 


2.9 
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where  either  KG  or  (KG)~1  may  be  substituted  for  L,  and  the  right-hand  inequality 
requires  o(I+L)>  1.  Clearly  then, 

o  ( I +L)  >>1  >  o(I  +  L_1)  5  1 

Thus,  desensitization  can  be  obtained  only  at  frequencies  where  stability  margins  need 
not  be  large.  Since  large  stability  margins  are  always  required  at  sufficiently  high 
frequencies,  no  desensitization  can  be  achieved  there. 


As  an  aside,  note  that  the  left-hand  inequality  in  2.9  may  be  rewritten  as 


for  od  +  L'1)  <1. 
o ( I+L-1) <<  1 


o( I+L) 


a( I  +  L  1 ) 
l-0( I+L-1 ) 


Hence 


o ( I+L) << 


1 


2.10 


Recall  that  L  may  be  replaced  by  either  KG  or  (KG)'l.  So  roughly  speaking,  "poor" 
sensitivity  properties  at  any  frequency  is  equivalent  to  “poor"  margins  there. 


3.  Functional  Limitations  on  Transfer  Functions 


We  have  seen  that  the  trade-off  between  desensitization  and  stability  margins  applies 
to  multivariable  feedback  systems  as  well  as  to  single-loop  systems.  Typically, 
desensitization  and  disturbance  rejection  are  desired  at  low  frequencies,  and  large 
stability  margins  are  required  at  high  frequencies.  A  critical  factor  in  the  design 
process  is  the  necessity  to  make  the  transition  from  large  loop  gain  to  small  loop 
gain.  Throughout  this  transition  region,  poor  sensitivity  and  poor  stability  margins 
must  be  avoided. 


The  difficulty  in  this  transition  depends  on  the  required  attenuation  rate  of  the 
loop  gain.  The  steeper  the  loop  gain  drops  off,  the  wider  the  frequency  range  over 
which  it  may  be  large.  Unfortunately,  however,  FDLTI  transfer  functions  behave  in  such 
a  way  that  steep  attenuation  comes  only  at  the  expense  of  small  o[I+GK]  values  and 
small  o(I  +  ( GK ) ~ 1 )  values  when  o[GK]  and  o(GK]  i  1.  This  means  that  while 
performance  is  good  at  lower  frequencies  and  stability  robustness  is  good  at  higher 
frequencies  both  are  poor  near  crossover.  The  behavior  of  FDLTI  transfer  functions, 
therefore,  imposes  a  second  major  limitation  on  the  achievable  performance  of  feedback 
systems. 

SISO  Transfer  Function  Limitation 


For  SISO  cases,  the  conflict  between  attenuation  rates  and  loop  quality  at  crossover  is 
again  well  understood.  We  know  that  any  rational,  stable,  proper,  minimum  phase  loop 
transfer  function  satisfies  fixed  integral  relations  between  its  gain  and  phase 
components.  Hence,  its  phase  angle  near  crossover  (i.e.,  at  values  of  w  such  that 
igk(jio)1  1)  is  determined  uniquely  by  the  gain.  Various  expressions  for  this 
angle  were  derived  by  Bode  using  contour  integration  around  closed  contours 
encompassing  the  right  half  plane.  [6,  Chapters  13,  14)  One  expression  is 
A 

i’gkc  =  arq(gk(3wc)  ( 


in  l  gk  ( jw  ( v)  )  l -!.n  l  qk  (  jwc)  i  dv  3.1 

sinh  v 

where  v  =  tnlw/w,-’  ,  w(v,  =  mc  exp  v.  Since  the  sign  of  sinh  (v) 
is  the  same  as  tne  sign  of  v,  it  follows  that  4gkc  wiH  he  large  if  the  gain 
Igk  I  attenuates  slowly  and  small  if  it  attenuates  rapidly.  In  fact,  <i>gkc  is 
given  explicitly  in  terms  of  weighted  average  attenuation  rate  by  the  following 
alternate  form  of  (3.1) (also  from  |6J: 

*g  k  c  *  -  /'  d  in  l  g  k  i  ^g.n  coth  dv  3.2 


The  behavior  of  ‘fcgkc  is  significant  because  it  defines  the  magnitudes  of  our  two 
SISO  design  conditions  2.2  and  2.7  at  crossover.  (Equations  17  and  19  in  (1]). 
Specifically  when  igki  =  1,  we  have 


1 1  +  gk  I 


ll+lgk)*1! 


2'»" 
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The  quantity  of  m  +  1>g|<c  is  the  phase  margin  of  the  feedback  system.  Assuming  qk 
stable,  this  margin  must  be  positive  for  nominal  stability  and,  according  to  3.3  it 
must  be  reasonably  large  (51  rad)  for  good  return  difference  and  stability  robustness 
properties.  If  1’+4gkc  is  forced  to  be  very  small  by  rapid  gain  a  t  ten  t  ua  t  ion ,  the 
feedback  system  will  amplify  disturbances  ( *  1 +g  k 1 <  < 1 )  and  exhibit  little 
uncertainty  tolerance  at  and  near  mc .  The  conflict  between  attenuation  rate  and 
loop  quality  near  cross-over  is  thus  clearly  evident. 
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It  is  also  known  that  more  general  nonminimum  phase  and/or  unstable  loop  transfer 
functions  do  not  alleviate  this  conflict.  If  the  plant  has  right  half  plane  zeros,  for 
example,  it  may  be  factored  as 

g (s)  =  m(s)  p(s)  3.4 


where  m(s)  is  minimum  phase  and  p(s)  is  an  all-pass  (i.e.,  Ip(jui)  l  =  1  V  ui.  ) 
The  (negative)  phase  angle  of  p(s)  reduces  total  phase  at  crossover,  i.e.. 


$gkc  '  ♦  mkc  +  ^pc  1  ‘•’mkc 

and  therefore  aggravates  the  tradeoff  problem.  In  fact,  if  !*pCl  is  too  large, 
we  will  be  forced  to  reduce  the  crossover  frequency.  Thus  rhp  zeros  limit  loop  gain 
(and  thus  performance)  in  a  way  similar  to  the  unstructured  uncertainty.  A  measure  of 
severity  of  this  added  limitation  is  II  -  p(ju)l,  which  can  be  U3ed  just  like 
( o>)  to  constrain  a  nominal  minimum  phase  design.. 

If  g (s)  has  rhp  poles,  the  extra  phase  lead  contributed  by  these  poles  compared  with 
their  mirror  images  in  the  left  half-plane  is  needed  to  provide  encirclements  for 
stability.  Unstable  plants  thus  also  do  not  offer  any  inherent  advantage  over  stable 
plants  in  alleviating  the  cross-over  conflict. 

Multivariable  Generalization 


The  above  transfer  function  limitations  for  SISO  systems  have  multivariable 
generalizations,  with  some  complications  as  would  be  expected. 

The  major  complication  is  that  singular  values  of  rational  transfer  matrices,  viewed  as 
functions  of  the  complex  variable  s,  are  not  analytic  and  therefore  cannot  be  used  for 
contour  integration  to  derive  relations  such  as  3.1.  Eigenvalues  of  rational  matrices, 
on  the  other  hand,  have  the  necessary  mathematical  properties.  Unfortunately,  they  do 
not  in  general  relate  directly  to  the  quality  of  the  feedback  design.  Thus,  we  must 
combine  the  properties  of  eigenvalues  and  singular  values  through  the  bounding  relations 

a  [  A]  <  I  A  [All  <  a  [A]  3.6 

which  holds  for  any  eigenvalue,  Aj,  of  the  (square)  matrix  A.  The  approach  will  be 
to  derive  gain/phase  relations  as  in  equation  3.1  for  the  eigenvalues  of  I  +  GK  and  I  + 

( GK ) ~ 1  and  to  u=“  these  to  bound  their  minimum  singular  values.  Since  good 
performance  and  stability  robustness  requires  singular  values  of  both  of  these  matrices 
to  be  sufficiently  large  near  crossover,  the  multivariable  system's  properties  can  then 
be  no  better  than  the  properties  of  their  eigenvalue  bounds. 

Equations  for  the  eigenvalues  themselves  are  straighforward.  There  is  a  one-to-one 
correspondence  between  eigenvalues  of  GK  and  eigenvalues  of  I  +  GK  such  that 

1  i  [ I +GK )  *  1  +  1 j  ( GK )  3.7 

Likewise  for  I  +  (GK)'l; 

X  i  [  1  +  (GK)  _1)  =  1  +  _ J _  3.8 

>i[GK] 

Thus  when  I A  ^  [  GK )  I  =  1  for  some  Aj  and  ui  =  u>c,  we  have 

"ti)j  c 

I  A  i [  I +GK  )  I  =  I  X  £  (  H-  (GK )  ” 1 1  l  =  2 1  s i n  — _i -  3.9 

Since  this  equation  is  exactly  analogous  to  equation  3.3  for  the  scalar  case,  and  since 
t  X  £ l  bounds  a,  it  follows  that  the  loop  will  exhibit  poor  properties  whenever 
the  phase  angle  C  +  tx  c)  is  small. 

In  order  to  derive  expressions  for  the  angle  <t>  \  c  itself,  we  require  certain 
results  from  the  theory  of  algebraic  functions.  [9-15]  The  key  concepts  needed  from 
these  references  are  that  the  eigenvalues  Xj  of  a  rational,  proper  transfer 
function  matrix,  viewed  as  a  function  of  the  complex  variable,  s,  constitute  one 
mathematical  entity,  X,  called  an  algebraic  function.  Each  eigenvalue,  Xj,  is  a 
branch  of  this  function  and  is  defined  on  one  sheet  of  an  extended  Riemann  surface 
domain.  On  its  extended  domain  an  algebraic  function  can  be  treated  as  an  ordinary 
meromorphic  function  whose  poles  and  zeros  are  the  system  poles  and  transmission  zeros 
of  the  transfer  function  matrix.  It  also  has  additional  critical  points,  called  branch 
points,  which  correspond  to  multiple  eigenvalues.  Contour  integration  is  valid  on  the 
Riemann  surface  domain  provided  that  contours  are  properly  closed. 


In  the  contour  integral  leading  to  3.1,  gk(s)  may  therefore  be  replaced  by  the 
algebraic  function,  X(s),  with  contour  taken  on  its  Reimann  domain.  Carrying  out 
this  integral  yields  several  partial  sums: 


✓ 

i 


V 


1 


e n |  X  -  (jr.i(v) )  |  -  ln|Aj(j<u  ) 
sinh~\> 


d  \t 


/ 

i 


3.10 
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where  each  sum  is  over  all  branches  of  A(s)  whose  sheets  are  connected  by  right 

half-plane  branch  points.  Thus  the  eigenvalues  (Aj)  are  restricted  in  a  way 

similar  to  scaler  transfer  functions  but  in  summation  form.  The  summation,  however, 
does  not  alter  the  fundamental  tradeoff  between  attenuation  rate  and  loop  quality  at 
crossover.  In  fact,  if  we  deliberately  choose  to  maximize  the  bound  3.9  by  making 
ojc  and  4\ic  identical  for  all  i,  then  3.10  imposes  the  same  restrictions  on 
multivariable  loops  as  3.1  imposes  on  SISO  loops.  Hence,  multivariable  systems  do  not 
escape  the  fundamental  transfer  function  limitations. 

As  in  the  scalar  case,  expression  3.10  is  again  valid  for  minimum  phase  systems  only. 
That  is  GK  can  have  no  transmission  zeros  in  the  rhp.  (For  our  purposes,  transmission 

zeros  (17)  are  values  such  that  det  (G(s)K(s)]  =  0.  Degenerate  systems  with  det  (GK)  = 

0  for  all  s  are  not  of  interest  because  they  cannot  meet  the  condition  in  equation 
(2.4)  at  any  frequency.)  If  this  is  not  true,  the  tradeoffs  governed  by  3.9-3.10  are 
aggravated  because  every  rhp  transmission  zero  adds  the  same  phase  lag  as  in  3.5  to 
one  of  the  partial  sums  in  3.10.  The  matrix  GK  may  also  be  factored,  as  in  3.4  to  get 

GK(s)  =  M ( s)  P(s)  3.11 

where  M(s)  has  no  rhp  zeros  and  P(s)  is  an  all-pass  matrix  PT(-s)P(s)  =  I.  Analogous 
to  the  scalar  case,  <j(I-P(s))  can  be  taken  as  a  measure  of  the  degree  of 
multivariable  nonminimum  phaseness  and  used  like  £m(u>)  to  constrain  a  nominal 
minimum  phase  design.  Nonminimum  phase  systems  are  considered  in  more  detail  in  the 
next  section. 

4.  Non-Minimum  Phase  Behavior  in  Multivariable  Systems 

Given  a  desired  rational,  strictly  proper  loop  transfer  function  H(s)  that  yields  a 
stable  closed-loop  system,  there  exists  rational,  strictly  proper  compensation  K(s)  for 
the  minimum  phase  plant  G(s)  such  that  the  desired  loop  transfer  function  is  fit 
arbitrarily  closely  and  the  resulting  closed-loop  system  is  stable.  Indeed,  it 
suffices  to  introduce  the  feedback  compensation 

s  +  (1-  I  4.1 


where  m  is  chosen  to  make  K(s)  strictly  proper  and  a  is  chosen  sufficiently  large. 

This  means  that  if  H(s)  is  designed  to  achieve  the  sensitivity  and  stability  margin 
tradeoff  discussed  in  Section  2.0,  then  any  minimum  phase  plant  can  be  compensated  to 
realize  this  loop  transfer  function  as  closely  as  desired.  The  same  result  applies  to 
multivariable  systems  as  well  as  single-loop  systems  by  interpreting  (4.1)  in  a  matrix 
sense.  In  the  multivariable  case,  however,  such  compensation  yields  the  desired  loop 
properties  only  at  the  plant  inputs;  the  output  loop  properties  can  be  quite  different. 


This  is  not  true  for  non-minimum  phase  systems, 
r ight-half-plane  zeros  may  be  factored  as 

G  (  s)  =  M(s)P(s) 


A  transfer  function  with 


where  M(s)  is  minimum  phase  and  P(s)  is  an  all-pass  (iP(Jw)l  =  1,  Vw)  .  The 
all-pass  cannot  be  cancelled  as  in  (4.1).  It  contributes  a  negative  phase  angle  and 
thereby  aggravates  the  tradeoff  problem.  Roughly  speaking,  a  non-minimum  phase  zero 
A  requires  small  loop  gain  throughout  some  freqency  interval  "near”  A (7).  That  is, 
the  advantages  of  feedback  are  achievable  at  frequencies  below  and  above  the 
non-minimum  phase  zero's  magnitude  but  not  around  it. 

Multivariable  non-minimum  phase  systems  are  subject  to  analogous  restrictions.  A 
plant  transfer  matrix  with  right-half-plane  zeros  may  be  factored  in  two  ways  as 

G  ( s)  =  Mi  (s)  Pits)  =  P2(s)M2(s)  4.3 

where  the  P's  are  all-pass  ( P*  ( Jui)  P(  ju)  «  I  Vw)  and  the  M's  are  minimum  phase. 

These  factorizations  can  be  expressed  in  state-space  terms.  Let  G(s)  be  the  transfer 
matrix  of  the  quadruple  (A,  B,  C,  D) .  Then  for  our  purposes,  G(s)  has  a  zero  at  A 


has  a  solution  with  w*w  =  1  (17)  for  a  single  real  zero  at  A.  One  factorization  (unique 
up  to  a  unitary  transformation)  Mi(s)Pi(s)  is  given  by 


Pi  ( s)  =  I 


and  Mj(s)  is  the  transfer  matrix  of  the  quadruple  (A,  B,  C,  D)  where 
B  =  B  -  2Axwt 
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For  the  case  of  a  single  pair  of  complex  zeros  at  X  and  X 

P,(s)  -  I  -  (—5 — - )(sRe(ww*-  ^  ww*)  +  Re ( Xww*-afiww* ) |  4  7 

1  s‘:+2as+XX  K 

B  =  B-2ciy  Ixw*-^-  xw*)|  4-8 

2fl2 

where  a  =  Re  (X)  ,  8  =  wTw,  and  y  =  (1  -  \'Y*  )  •  T^e  proofs  of  (4.5)  -  (4.8) 

follow  from  the  generalized  eigenvalue  problem  (4.4)  with  algebraic  manipulation.  Multiple 

non-minimum  phase  zeros  can  be  handled  by  repeated  application  of  (4.5)  -  (4.8).  We  note 

that  similar  formulas  may  be  obtained  for  the  factorization  P2<s)M2(s)  by  using  the 

left  generalized  eigenvector. 


The  non-minimum  phase  zero  X  adds  extra  phase  lag  in  the  input  direction  w  and  the 
output  direction  v.  This  is  the  same  effect  and  has  the  same  restrictions  for  feedback 
design  as  in  the  single-loop  case.  However,  the  transfer  matrix  from  inputs  orthogonal  to 
w  to  outputs  orthogonal  to  v  does  not  have  any  additional  phase  lag  caused  by  X.  Thus, 
the  non-minimum  zero  X  does  not  impose  any  limitations  on  the  loop  gain  in  these 
orthogonal  directions.  (Other  considerations  may  restrict  this  loop  gain.  For  example, 
the  transfer  matrix  associated  with  this  subspace  may  itself  be  non-minimum  phase,  although 
not  having  a  zero  at  X.) 

Because  a  non-minimum  phase  zero  X  does  not  impose  limitations  in  directions  orthogonal 
to  w  and  v,  it  could  be  the  case  that  a  feedback  design  has  large  loop  gain  in  these 
directions.  The  next  section  discusses  problems  associated  with  a  spread  in  the  singular 
values  of  KG.  Such  a  design  would  necessarily  have  a  large  spread  in  the  singular  values 
of  KG  at  frequencies  near  X. 

5.  Directionality  in  Multivariable  Feedback  Systems 

The  preceeding  sections  discussed  frequency-domain  tradeoffs  which  occur  in  the 
design  of  both  single-loop  and  multivariable  feedback  systems.  In  this  section,  we 
examine  some  tradeoffs  that  are  unique  to  multivariable  systems. 

Multivariable  system,  unlike  scalar  system,  signals  have  a  "spatial"  as  well  as 
frequency  distribution.  Some  sensors  are  noisier  than  others.  Certain  actuators  might 
saturate  at  lower  signal  levels  than  others.  Disturbances  enter  particular  loops,  but 
not  others,  etc.  One  is  lead  to  consider,  for  example,  different  "bandwidths"  in 
various  loops. 

The  stability  condition  (2.7)  is  tight  if  all  that  is  known  about  the  perturbation 
A  is  its  maximum  singular  value.  However,  if  more  is  known  about  A  -  large  in  one 
direction,  small  in  another  --  then  condition  (2.7)  can  be  sharpened  through  the  use  of 
frequency-dependent  weighting  matrices  (8(.  That  is,  for  a  perturbed  plant 
G ( s)  ( I+A ( s) ) ,  the  closed-loop  system  remains  stable  for  all  (stable)  perturbations 
A(s)  satisfying 

0  (Ql  (  joo)  A  (  jw)  Rjl  (  j(D)  I  <0  IRj  (  ju>)  (I+K  ( jw)  G(  jdl)  _1)  Q{ 1  (  j<*>)  1  V  w  >  0  5.1 

The  known  characteristics  of  the  the  allowable  perturbations  are  reflected  in  the 
(nonsingular)  weighting  matrices  Rj  and  Qj. 

The  stability  margin  condition  (5.1)  yields  margins  at  a  particular  point  in  the 
feedback  configuration  of  Figure  1.  These  margins  are  at  the  inputs  to  the  plant.  The 
weighting  matrices  in  (5.1)  are  useful  in  handling  the  issue  of  directionality  at  this 
point.  For  single-loop  systems,  margins  at  the  input  are  identically  equivalent  to 
margins  at  the  output.  Input  and  output  margins  are  not  equivalent,  however,  in  the 
muyltivariable  case.  Output  margins  are  obtained  by  considering  a  perturbed  plant 
( 1+ A0(s) ) G ( s) .  The  stability  condition  (using  weighting  matrices  R0  and  0°) 
may  be  written  as 

°(Qo< Jw) *o<jw> Ro1 < jw) 1 0 lRop< lw>  ( I+ (G( jw) K ( jw) ) _1 ( jw) |  Mw>o  5.2 

This  is  the  same  condition  as  (5.1),  with  G  and  K  interchanged. 

From  a  design  viewpoint,  good  margins  may  be  desired  at  both  the  inputs  and  the  outputs 
of  the  plant.  One  may  express  this  objective  slightly  differently  by  first  fixing  the 
input  margins  and  then  examining  the  limits  imposed  on  the  output  margins.  More 
precisely,  we  pose  the  following  formal  optimization  problem  at  a  single  frequency: 

max  o(R0(I  +  (GK)  'MoS1! 

K 

r  -1  -1  (P) 

s.t.  0.  [RT  (I+(KG)  )Qj  ]  =  1  Vi 
and  now  we  require  both  G  and  K  to  be  square. 


This  constraint  of  all  singular  values  equal  to  1  is  equivalent  to  constraining  the 
matrix  to  be  unitary.  Hence 
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I+(KG)-1  =  Rf^UQj  5.3 

for  some  unitary  U.  The  function  to  be  maximized  can  then  be  written  as 
0|R0(I  +  (GK)-l)Q3l)  =  O  (  R0G  ( 1+ (KG)  )  G^-Qo1  J  5.4 

=  £[R0GRI"1)  U(QqGQi_1) ] 

This  means  the  original  problem  (P)  is  equivalent  to 
max  o[R0GRil)U(Q0GQji) 'll 

U  (P1) 

s.t.  UU*  =  X 

To  solve  (P1),  let  RoGRJ1  =  U1IiV1*and  QoGQi1=02J:2v2* 
be  singular  value  decompositions  where 

=  diag|a1(oi2. anl  <  al^a2l  —  an2  0 

f 2  =  diag  [  82#  •  • .  8n|  , 

Making  these  substitutions  into  (P1) , 

0(RoGRjl)  U(Q0GQJ^-)  2 1 ^  1UIX 2 ^  1  where  U  =  V^UVj  a  un7tary  matrix.  8.5 

Proposition 

max  olljUfJ'*')  =  min 
U  7. 

S.t  gu*  =  I 

proof :  Appendix 

The  formal  problem  (P)  can  be  interpreted  in  terms  of  this  proposition.  The  best 
possible  output  margins  equal  the  minimum  of  the  ordered  ratio  of  the  singular  values 
of  R0GRfl  to  the  singular  values  of  Q0GQjl-.  When  the  uncertainty  at  the 
input  and  output  is  unstructured,  i.e.,  Rj,  R0,  Qi>  Q0  are  all  scalar  multiples 
of  the  identity,  the  maximum  achievable  margins  at  the  outputs  equal  1  --  the  same  as 
the  margins  at  the  inputs.  This  unstructured  case  causes  the  multivariable  problem  to 
be  analogous  to  the  single-loop  case.  When  the  uncertainties  do  have  some 
directionality  or  structure  (as  reflected  in  non-identity  R's  and  Q's),  the  feedback 
problem  is  inherently  multivariable.  This  directionality  may  impose  additional 
limitations  on  the  achievable  performance. 

In  the  Appendix,  it  is  shown  that  using  0  equal  to  the  identity  in  the  proposition 
achieves  the  maximum.  With  this  choice  of  U,  we  can  solve  for  K  from  (5.3)  as 

K  =  (Rjlv1V2*QI-I)'1G*1  5.6 

assuming  that  (J  =  X  yields  Rf1ViVr2*Ql_I  invertible.  As  far  as  actually 
achieving  this  limit  at  each  frequency,  note  that  and  V2*  need  not  be  rational 
functions  of  frequency.  Hence,  in  general,  we  cannot  achieve  this  limit  with  a  causal 
compensator  K(s).  The  question  of  the  limitations  imposed  on  the  output  margins  by 
both  fixed  input  margins  and  a  causal  compensator  is  still  open. 

In  concluding  this  analysis,  we  point  out  that  the  formal  problem  considered  here  is  not 
especially  important  in  and  of  itself.  One  could  pose  several  other  such  problems, 
e.g.,  maximizing  o(I+KG)  subject  to  constraints  on  I+GK,  etc.  The  reason  for  posing 
this  problem  is  that  it  illustrates  how  directionality  can  introduce  uniquely 
multivariable  phenomena. 

Recall  from  the  section  on  multivariable  nonminimum  phase  systems  that  a  rhp  zero  may 
cause  a  spread  in  singular  values.  This  may  cause  a  problem  similar  to  that  caused  by 
nonidentity  R  and  Q  weightings.  Thus,  non-minimum  phase  zeros  affect  multivariable 
feedback  design  in  two  ways.  First,  they  contribute  additional  phase  lag  in  certain 
input  and  output  directions.  This  limits  the  achievable  performance  in  these 
directions  and  is  analogous  to  non-minimum  phase  zeros  in  single-loop  systems.  The 
resulting  unequal  gains  in  different  directions  can  lead  to  a  conflict  between  input 
and  output  loop  properties.  This  second  effect  has  no  single-loop  analog. 

A  related  problem  arises  when  simultaneous  input  and  output  margins  are  considered. 

This  involves  letting  the  perturbed  plant  be  ( I-tAj)  G(  H-A2)  where  Aj  and 
A2  are  independent  perturbations.  The  companion  paper  (1)  has  an  example 
indicating  that  evaluating  margins  one  channel  at  a  time  does  not  give  a  reliable 
measure  of  robustness  with  respect  to  simultaneous  variations  in  multiple  channels. 

Thus  it  is  not  surprising  that  simple  examples  can  be  constructed  where  robustness  with 
respect  to  simultaneous  input  and  output  perturbations  is  significantly  worse  than 
robustness  for  either  point  considered  separately.  If  a  realistic  representation  of 
the  plant  uncertainty  involved  perturbations  at  both  inputs  and  outputs,  then  caution 
should  be  used  in  drawing  conclusions  based  on  analysis  for  either  point  individually. 


a . 

•ih* 


To  analyze  the  problem  of  simultaneous  perturbations  we  may  consider  a  unity  feedback 
system  with  nominal  transfer  function 


and  perturbed  transfer  function 


This  system  may  then  be  analyzed  for  robustness  with  respect  to  the  perturbation 


using  the  techniques  outlined  in  Section  2.  Unfortunately,  this  would,  in  general, 
give  a  conservative  answer  since  the  standard  singular  value  analysis  tools  do  not 
exploit  the  structure  of  the  perturbation  in  (5.9).  Again,  simple  examples  can  easily 
be  constructed  to  demonstrate  this. 

Recent  research  has  led  to  an  exact  solution  (i.e.,  not  conservative)  for  perturbations 
with  structure  such  as  in  (5.9).  These  results  are  as  yet  unpublished  and  are  too 
lengthy  for  inclusion  here.  They  provide  reliable  and  exact  techniques  for  analyzing 
simultaneous  independent  perturbations  and  generalized  the  ideas  that  are  presented  in 
this  paper.  The  point  that  must  be  emphasized  is  that  while  the  issues  presented  in 
this  paper  are  central  to  the  multivariable  control  problem,  the  results  should  not  be 
interpreted  too  broadly. 

6.  Conclusions 

This  paper  has  used  singular  value  analysis  techniques  to  study  some  limitations  on  the 
achievable  performance  of  multivariable  feedback  systems.  The  results  give  useful 
insight  into  the  fundamental  tradeoffs  that  must  be  faced  in  feedback  design.  It  is 
important,  though,  that  these  results  not  be  interpreted  too  broadly. 

In  particular,  the  development  in  this  paper  is  often  quite  qualitative.  While  there 
is  a  central  theme  of  feedback  as  a  means  of  dealing  with  uncertainty,  the  specific 
results  are  fragmented.  A  unified  framework  which  deals  naturally  with  the  algebraic 
and  functional  performance  limitations  would  be  highly  desirable.  In  addition,  the 
issue  of  robustness  with  respect  to  simultaneous  input  and  output  perturbations  needs 
further  study,  as  does  the  more  general  problem  of  analyzing  perturbations  with 
specific  structure.  Great  progress  is  being  made  in  these  research  areas. 

The  important  message  in  this  paper  is  that  feedback  design  inevitably  involves 
tradeoffs.  Analysis  and  design  techniques  must  make  these  tradeoffs  clear  to  the 
control  engineer  so  that  there  is  a  rational  basis  for  making  design  decisions. 
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APPENDIX 

Proposition:  Let  -  dia[a^  •  • .  ,onJ  ,  “i  ia?  -•  •  -  i°n  and 
:2  *  di«gU1,i?,...,En]  .  61  >  £2  I- •  Then 
max  *  min  [j^-) 
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Hence  the  norm  of  the  matrix  on  the  rhs  of  (A. 2)  is  one,  and  so 
—  **  -1 

Vj  1  r 


(A. 4) 


Equation  (A. 4)  is  tree  for  all  unitary  U  ,  with  equality  being  obtained 
for  II  *  I .  So 
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Summary 

This  paper  demonstrates  how  the  linear-quadratic-guassian  (LQG)  methodology  can  be 
used  to  design  feedback  compensators  which  meet  multivariable  performance,  stability, 
and  stability  robustness  requirements  expressed  as  singular  value  conditions  in  the 
frequency  domain. 


1 . 0  Introduct  ion 


For  more  than  two  decades  now,  feedback  control  design  has  been  viewed  from  two 
distinct  perspectives.  One  perspective  is  based  on  the  frequency  domain.  Its  roots 
lie  in  the  classical  single-input  single-output  (SISO)  design  methodologies  of  Bode  and 
Nyquist  (eg.  6,7),  and  its  philosophy  has  been  carried  over  to  various  attempted 
multivariable  generalizations  (eg.  9,  10,  11].  The  second  perspective  is  based  on  the 
time  domain.  Its  roots  lie  in  the  state  space  concept  for  system  descriptions  and  in 
1  inear-quadrat ic-gaussian  (LQG)  optimization  for  feedback  synthesis  [eg.  13,  14]. 

Despite  recognized  one-to-one  mathematical  relations  between  these  two 
perspectives,  feedback  design  has  been  taught  and  practiced  in  terms  of  one  perspective 
or  the  other.  Advocates  developed  for  each  side,  and  the  pros  and  cons  were  debated 
enthusiastically.  The  freqeuncy  domain  claimed  to  provide  a  natural  framework,  namely 
transfer  function  loop-shaping,  in  which  to  express  and  satisfy  practical  design 
requirements,  particularly  the  ever-present  requirement  for  uncertainty  tolerance. 

This  natural  framework,  however,  worked  best  for  SISO  systems  and  did  not  appear  to 
generalize  easily  to  multi-input  multi-output  (MIMO)  problems.  The  time  domain  claimed 
precisely  the  latter  capability,  i.e.,  its  design  techniques  handled  MIMO  systems  as 
well  as  time  variations  and  stochastics  with  efficient  computerized  algorithms.  Its 
weakness,  however,  was  the  lack  of  clear  relationships  between  practical  design 
requirements  and  the  postulated  linear-quadratic-gaussian  notion  of  optimality. 

Fortunately,  the  last  few  years  have  brought  a  synthesis  of  these  perspectives 
which  capitalizes  on  the  strengths  of  both  (1).  The  first  step  of  this  synthesis  was 
the  reliable  extension  of  classical  frequency  domain  loop-shaping  ideas  to  MIMO 
problems.  This  step  is  discussed  in  (2]  earlier  in  these  proceedings.  The  second  step 
of  the  synthesis  was  the  discovery  that  desirable  multivariable  loop-shapes  can  be 
produced  with  the  linear-quadratic-gaussian  methodology.  This  step  is  described  in 
this  paper.  We  begin  with  a  brief  review  of  the  multivariable  design  problem  in 
Section  2.  We  then  show  how  this  problem  can  be  addressed  with  the  LQG  methodology  in 
Section  3,  and  provide  a  simple  illustrative  example  in  Section  4.  The  bulk  of  this 
material  is  taken  directly  from  Reference  (1). 

2 . 0  The  Multivariable  Feedback  Design  Problem 

Our  objective  will  be  to  design  a  dynamic  compensator  for  the  generic  feedback  loop 
shown  in  Figure  1.  We  have  a  plant  (G)  with  input  vector  (u) ,  output  vector  (y) , 
disturbance  vector  (d)  as  seen  at  the  output,  and  measurement  noise  vector  (n) .  Our 
compensator  is  designated  as  K.  For  convenience,  K  is  showr  in  the  generic  forward 
path  location  from  command  vector  (r),  i.e.,  unity  feedback.  More  general  situations 
with  K  is  the  return  path  from  y  or  with  K  split  between  the  forward  and  return  paths 
are  equivalent  to  the  forward-path  location  if  appropriate  pre-filters  (P)  are  used  to 
shape  the  commands.  Since  P  does  not  affect  feedback  properties,  however,  we  will  not 
deal  with  its  design  issues  here. 
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Figure  1. 
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The  plant  G  is  a  ceal  physical  system.  Hence,  its  detailed  behavior  must  be 
expected  to  be  quite  compl icated--nonl inear ,  infinite  dimensional,  time-varying,  etc. 
Nevertheless,  we  will  assume  that  G  can  be  approximated  by  a  finite  dimensional, 
linear,  time  invariant  (FDLTI)  model  with  transfer  function,  G(s).  The  inevitable 
errors  which  this  model  makes  will  be  represented  by  unstructured  multiplicative 
perturbations  of  the  form  [2] 

G*(s)  =  [I  +  L(s)  )G(s)  .  (1) 

Here  G'(s)  is  a  possible  "true"  system,  and  L(s)  is  our  FDLTI  system  with  known  bounded 
maximum  gain,  tm(s).  Any  L(s)  which  does  not  exceed  this  maximum  gain  generates 
another  potential  "true"  system.  Hence,  equation  (1)  actually  describes  an  entire 
family*  of  plants  for  which  our  compensator  must  be  designed. 

The  design  problem  will  be  to  provide  good  command-following  (i.e.,  small  errors, 
e=y-r,  in  response  to  commands),  good  disturbance  rejection  (i.e.,  small  errors  in 
response  to  disturbance  excitation),  and  small  responses  to  sensor  noise,  all  subject 
to  the  constraints  imposed  by  modelling  errors  in  (1).  In  an  earlier  paper  in  these 
proceedings  [2),  Doyle  shows  that  these  various  design  objectives  can  be  interpreted  as 
conditions  on  the  loop  transfer  function  matrices  GK(s),  evaluated  at  real  frequencies 
s  =  ju).  We  need  large  loop  transfer  matrices  in  the  frequency  range  where  commands 
and/or  disturbances  are  large,  small  loop  transfer  matrices  in  the  frequency  range 
where  modelling  errors  are  large,  and  well-behaved  crossover  regions  in  between. 
"Well-behaved"  crossovers  are  those  which  achieve  nominal  stability  and  maintain 
reasonably  large  return  difference  matrices,  I+GK(jw),  and  inverse  return  difference 
matrices,  1+ (GK ( jw) ) “l.  These  frequency  domain  requirements  of  multivariable 
feedback  design  are  summarized  in  Figure  2. 


Classical  feedback  designers  will  recognize  that  the  frequency  domain 
interpretations  above  are  completely  analogous  to  the  classical  single-input 
single-output  design  methodology  (References  6,  and  7  for  example).  The  only 
distinction,  in  fact,  is  that  different  measures  are  used  to  judge  when  transfer 
functions  are  "large"  or  "small."  For  scalar  transfer  functions,  the  absolute  value 
serves  as  a  single  measure  of  size.  The  scalar  gk  is  large  if  igki  >>1,  and  it 
is  small  if  Igki  <<  1.  For  multivariable  systems,  on  the  other  hand,  two 
singular  values  (or  principal  gains)  serve  as  measures  of  size.  The  matrix,  GK,  is 
large  if  its  smallest  singular  value  (smallest  gain)  is  large,  i.e.,  if  c[GK)  >> 

1_.  It  is  small  if  its  largest  singular  value  (largest  gain)  is  small,  i.e., 
o(GK) <<  1.  More  discussion  of  singular  values  and  their  interpretations  are  given 
in  (1),  (2)  and  (8).  The  key  point  to  note  here  is  that  singular  values  reduce  the 
multivariable  feedback  design  problem,  in  concept,  to  the  familiar  frequency  domain 
loop-shaping  problem  of  classical  control. 


*For  deep  technical  reasonsdiscussed  in  (3),  the  number  of  unstable  poles  of  G' (s)  must 
be  known  for  all  members  of  this  family.  We  will  assume  that  this  number  is  constant 
and  equal  to  the  numbers  of  unstable  poles  of  G(s).  Note  that  each  G' (s)  in  the  family 
is  FDLTI.  This  restriction  can  be  removed  via  "sector  theory"  (4,  5). 


5-3 


3 . 0  Singular-Value  Loop-Shaping 

The  discussions  in  (2]  concentrate  on  singular  values  as  analysis  tools,  i.e.,  as 
instruments  to  specify,  analyze  and  judge  the  quality  of  multivariable  feedback  loops. 
Little  is  said  about  their  use  as  synthesis  tools.  It  is  clear,  of  course,  that  once 
we  accept  these  instruments  for  analysis,  then  it  also  becomes  desirable  to  have  design 
techniques  which  directly  manipulate  them  for  synthesis.  Ideally,  we  seek  design 
techniques  which  take  plant  descriptions  and  performance  specifications  as  inputs  and 
automatically  produce  compensators  as  outputs  which  satisfy  the  various  singular  value 
requirements  of  Figure  2. 

At  today's  state-of-the-art,  there  are  no  design  techniques  which  fully  satisfy 
these  synthesis  desires.  There  are  a  number  of  techniques,  however,  which  offer 
multivariable  design  capability  and  which  may  be  compared  against  the  singular  value 
loop-shaping  interpretation.  One  major  group  of  techniques,  for  example,  consists  of 
frequency  domain  methods  which  approximate  the  multivariable  problem  by  a  sequence  of 
scalar  problems.  This  is  done  by  constructing  a  set  of  scalar  design  functions  which 
may  be  manipulated  more  or  less  independently  with  classical  techniques.  The  ad  hoc 
"single-loop-at-a-time"  technique  in  common  engineering  practice  (9),  for  instance, 
uses  a  predefined  sequence  of  inner,  outer,  and  crossfeed  loops  to  accomplish  the 
design.  The  more  systematic  "Inverse  Nyquist  Array  (INA)“  procedure  [10]  uses  diagonal 
elements  of  a  pre-  and  post-  compensated  diagonally  dominant  transfer  matrix  as  design 
functions,  and  the  "Characteristic  Loci  (CL)“  methodology  discussed  in  [11]  and  earlier 
in  these  proceedings  [12]  uses  the  eigenvalues  of  the  transfer  matrix. 

Singular  value  loop-shaping  capabilities  of  these  various  procedures  are  examined 
in  (1)  and  [2].  They  turn  out  to  be  indirect  and  not  generally  reliable.  Elaboration 
of  this  judgment  will  be  left  to  the  references,  however,  and  we  will  not  study  these 
methods  further  here.  Rather,  we  will  explore  an  alternate  multivariable  design 
technique  --  the  Linear -Quadrat ic-Gaussian  (LQG)  methodology  [13,  14].  Although  this 
procedure  was  not  originally  designed  to  perform  direct  singular  value  loop-shaping,  it 
currently  appears  to  be  the  only  systematic  procedure  available  which  offers  this 
capability. 

The  LQG  Procedure 

As  is  well-known,  the  LQG  methodology  was  conceived  to  perform  control  system 
synthesis  on  the  basis  of  time  domain  optimization.  The  method  uses  the  following 
state  space  description  of  the  plant,  G: 

x  =  Ax  +  Bu  +  C  (2) 

y  -  Cx  +  n  (3) 

where  x  is  an  n-dimensional  state  vector,  u  and  y  are  m-dimensional  control  inputs  and 
r -dimensional  measurements,  respectively,  and  E  and  n  are  white  noise  processes. 

The  A,  B,  and  C  matrices  satisfy 

G(s)  =  C»<s)B 

d(s)  =  C*(s)  C  (  s) 

with 

*<s)  =  (si  -  A)'l.  (6) 


(4) 

(5) 


The  feedback  design  task  is  then  posed  as  an  optimization  problem  to  find  a  control  law 
u(t)  =  f | y ( t ;  t  <  t]  (7) 

to  minimize  the  following  performance  criterion* 

(zTz  +  p  uTu)dt)  (8) 


(9) 

m-dimensional  response  weighted  in  (8),  p  is  a  scalar  parameter, 
usual  mathematical  expectation. 

optimization  problem  is  given  by  [13] 

(10) 

where  Kc  is  a  full-state  linear-quadratic  regulator  (LQR)  gain  defined  by  the 
familiar  Riccati  equation 

0  =  PA  +  ATP  +  HtH  -  PB_i_BTP  (11) 

P 

Kc  *  -L.  BTP  (12) 

*This  criterion  represents  no  loss  of  generality  over  the  usual  form  (13)  with 
weighting  matrices  Q=QT^0  and  R=RT>0.  The  matrix  R  can  be  incorporated  in  B 
via  B'=BRl/2  and  HTH  is  the  most  general  form  necessary  for  Q  [15). 


with 


J  =  E  [Him 

T 


i  J 


(lx 


The  vector,  z,  is  an 
and  E { • }  denotes  the 

The  solution  of  this 

u*(t)  =  -Kcx ( t ) , 


(13) 


and  the  state  estimate  $(t)  is  given  by  the  Kalman  filter  (KBF) ; 
x  =  Ax  +  Bu  +  Kf [y (t)  -  Cx(t) 1 

whose  gain,  Kf,  is  defined  by  the  filter's  own  Riccati  equation 

0  *  AE  ♦  EAT  +  rrT  -  ZCTyCZ  (14) 

KF  =  1CT_1_  (15) 

Here  rrT  is  the  intensity  matrix  of  the  process  noise,  C,  and  ul  with  u 
scalar  is  the  intensity  matrix  of  the  sensor  noise,  n.*** 

It  follows  from  (10)  and  (13)  that  the  optimal  LOG  controller  is  a  cascaded 
combination  of  a  KBF  and  a  full-state  LQR.  For  our  purposes,  this  combination  can  be 
treated  as  an  ordinary  FDLTI  compensator: 

K(s)  *  Kc(sl  -  A  +  BKc  +  KfC)-lKf.  (16) 

This  compensator  has  the  special  internal  structure  shown  in  Figure  3.  In  terms  of  our 
previous  discussions,  the  functions  of  interest  in  Figure  3  are  the  loop  transfer, 
return  difference,  and  inverse  return  difference  functions 

GK,  Ir  +  GK,  Ir  +  (GK)-l,  (17) 

and  also  their  counterparts 

KG,  Im  +  KG,  Im  +  (KG)-l  (18) 

The  first  three  functions  measure  performance  and  stability  robustness  with  respect  to 
uncertainties  at  the  plant  outputs  (loop-breaking  point  (i)  in  Figure  3),  and  the 
second  three  measure  performance  and  robustness  with  respect  to  uncertainties  at  the 
plant  input  (loop-breaking  point  (ii)  in  Figure  3).  Both  points  are  generally 
significant  in  design. 

Two  other  loop-breaking  points,  ( i ) *  and  (ii)’,  are  also  shown  in  the  figure. 

These  are  internal  to  the  compensator  and  therefore  have  little  direct  significance. 
However,  they  have  desirable  loop  transfer  properties  which  can  be  related  to  the 
properties  of  points  (i)  and  (ii).  The  properties  and  connections  are  these: 

Fact  1  The  loop  transfer  function  obtained  by  breaking  the  LQG  loop  at  point  ( i ) * 
is  the  full-state  KBF  loop  transfer  function  C*Kf. 

Pact  2  The  loop  transfer  function  obtained  by  breaking  the  LQG  loop  at  point  (i) 
is  GK.  it  can  be  made  to  approach  C»Kf  pointwise  in  s  by  designing  the 
LQR  in  accordance  with  a  "sensitivity  recovery"  procedure  due  to  Kwakernaak 
(161  - 

Fact  3  The  loop  transfer  function  obtained  by  breaking  the  LQG  loop  at  point  (ii) ' 
is  the  full-state  LQR  loop  transfer  function  KC*B. 

Fact  4  The  loop  transfer  function  obtained  by  breaking  the  LQG  loop  at  point  (ii) 
is  KG.  It  can  be  made  to  approach  KC$B  pointwise  in  s  by  designing  the 
KBF  in  accordance  with  a  "robustness  recovery"  procedure  due  to  Doyle  and 
Stein  (17). 


COMPENSATOR  KIs) 


Figure  3. 


*  * "As  for  the  regulator,  there  is  no  loss  of  generality  in  these  selections  of 
statistics. 
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Facts  1  and  3  can  be  readily  verified  by  explicit  evaluation  of  the  transfer 
functions  involved.  Facts  2  and  4  take  more  elaboration  and  are  taken  up  in  a  later 
section.  They  also  require  more  assumptions.  Specifically,  G(s)  must  be  minimum  phase 
with  m>  r  for  Fact  2,  m<  r  for  Fact  4,  and  hence,  G(s)  musts  be  square  for  both. 

Also,  the  names  "sensitivity  recovery"  and  "robustness  recovery”  are  overly 
restrictive.  "Full-state  loop  transfer  recovery”  is  perhaps  a  better  name  for  both 
procedures,  with  the  distinction  that  one  applies  to  points  (i),  ( i ) '  and  the  other  to 
points  ( ii)  ,  (  ii)  1  . 

The  significance  of  these  four  facts  is  that  we  can  construct  LOG  loop  transfer 
functions  in  the  following  two-step  fashion: 

Step  1:  Design  a  full-state  feedback  law  with  desirable  singular  value  properties. 
This  is  easy  to  do,  as  we  will  see  shortly. 

Step  2:  Recover  (approximate)  this  full-state  loop  transfer  with  a  realizable  LOG 
control  law  using  one  of  the  recovery  procedures. 

In  order  to  apply  this  two-step  approach,  we  must  first  decide  which  loop-breaking 
point  to  design  for.  If  we  select  point  (i),  then  the  full  state  design  must  be  done 
with  the  KBF  design  equations  (i.e.,  its  Riccati  equation  (14)  -  (15)  and  recovery  with 
the  LQR  equations  (11)  -  (12).  If  we  select  point  (ii),  then  full-state  design  must  be 
done  with  the  LQR  equations  and  recovery  with  the  KBF.  The  mathematics  of  these  two 
options  are,  in  fact,  dual.  Hence,  we  will  describe  only  one  option  (for  point  (ii)) 
in  detail.  Results  for  the  other  are  merely  summarized  and  then  used  later  in  an 
example. 

Full-State  Loop  Transfer  Design 

The  intermediate  full-state  design  step  is  worthwhile  because  LOR  and  KBF  loops 
have  good  classical  properties  which  have  been  re-discovered  over  the  last  few  years 
[18]  -  (20].  The  basic  result  for  the  LOR  case  is  that  LQR  loop  transfer  matrices 

T  (  s )  =  Kc  4>  (  s)  B  (19) 

satisfy  the  following  return  difference  identity; 

(Im+  T(jw)|*  [Im+T(ju»]  =  Im+  [H<K  jo;)  B]  *  [H*  ( jui)  BJ  //» 

for  all  0  <  u>  <  «  (20) 

This  identity  can  be  derived  directly  from  the  LQR' s  Riccati  equation  [19].  Using  the 
following  definition  of  singular  values  (o)  in  terms  of  eigenvalues  (X); 

o2]M]  =  X(M*M] ,  (21) 

the  return  difference  identity  implies  that 

o|I,  +  T(jw)]  =  A  (1+  (H*B)*  H»B  ]/p' 

=  /!+  X  (  (H$B)  *  H«B ]/p 

=  /l+o2[H*(jo>)B]/p'  (22) 

This_expression  applies  to  all  singular  values  of  T(s)  and,  hence,  specificaly  to  n 
and  a.  It  governs  the  performance,  crossover,  and  stability  robustness  properties  of 
LQR  loops. 


Performance  Properties  --  Whenever  <j[T]>>1,  the  following  approximation  of  (22) 
shows  explicitly  how  the  parameters  p  and  H  influence  T(s): 

n(T(  jw)  ]  i  0[H»(jw)B]/Vp 


We  can  thus  choose  p  and  H  explicitly  to  satisfy  the  performance  requirements  in  Figure 
2  (high  gains  at  low  frequencies)  and  also  to  "balance"  the  multivariable  loop  such 
that  o [ T ]  and  o(T)  are  reasonably  close  together.  This  second  objective  is 
consistent  with  our  assumption  in  Section  2  that  the  transfer  function  G(s)  has  been 
scaled  and/or  transformed  such  that  the  uncertainty  bound,  tro(u>)  ,  applies  more  or 
less  uniformly  in  all  directions.  We  also  note  that  it  may  be  necessary  to  append 
additional  dynamics  in  order  to  meet  the  performance  requirements.  To  achieve  zero 
steady  state  errors,  for  example,  a[H$B]  must  tend  to  infinity  as  u>-f0.  This 
requires  m  free  integrations  in  the  plant  which  must  be  added  if  none  exist  to  start 
with. 


Crossover  Properties  --  Under  mild  assumption  on  A,  B,  C  and  H,  the  LQG  regulator  is 
nominally  stable  ( 1 3  J  .  It  also  follows  from  (22)  that  its  return  difference  always 
exceeds  unity:  i.e., 

n\ Im  +  T(jw)]^  1  for  all  0  <  u)  <  “  (24) 
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This  further  implies  1 2 1 1  that 

c  [  Im  *  T" 1  (jiu)|  ^1/2  for  all  0  <_  in  <  °°  (25) 

Hence,  LOR  loops  automatically  exhibit  well-behaved  crossovers. 

Robustness  Properties  --  It  follows  from  (25)  that  LQR  loops  are  guaranteed  to  remain 
stable  for  all  unstructured  uncertainties  (reflected  to  the  input)  which  satisfy 
tm( “) <0.5.  Without  further  knowledge  of  the  types  of  uncertainties  present  in 
the  plant,  this  bound  is  the  greatest  robustness  guarantee  which  can  be  ascribed  to  the 
regulator.*  While  it  is  reassuring  to  have  a  guarantee  at  all,  the  £m  <  0.5 
bound  is  clearly  inadequate  for  the  uncertainty  tolerance  requirements  of  Figure  2  with 
realistic  lm(iD)  's.  In  order  to  satisfy  these  requirements  in  LQR  designs, 
therefore,  it  becomes  necessary  to  directly  manipulate  the  high-frequency  behavior  of 
T ( s ) .  This  behavior  can  be  derived  from  known  asymptotic  properties  of  the  regulator 
as  the  scalar  p  tends  to  zero  (15,  16,  22,  23,  24).  The  result  needed  here  is  that 
under  minimum  phase  assumptions  on  H4>B,  the  LQR  gains  Kc  behave  asymptotically  as 
116) 

Sp  Kc=>  WH  (26) 

where  w  is  an  orthonormal  matrix.  The  LQR  loop  transfer  function,  T(s),  evaluated  at 
high  frequencies,  s=jc /^P~ with  c  constant,  is  then  given  by** 

T(j  c//p)  =  </p  Kc  ( jcI-2^A) -1B  =$,  WHB/jc.  (27) 

Since  crossovers  occur  at  <r ^  ( T )  =  1 ,  this  means  that  the  maximum  (asymptotic) 
crossover  frequency  of  the  loop  is 

“cmax  =  o(HB) //p.  (28) 

As  shown  in  Figure  2,  this  frequency  cannot  fall  much  beyond  id £,  where 
unstructured  uncertainty  magnitudes,  im,  approach  unity.  Hence,  our  choice  of  H 
and  p  to  achieve  the  performance  objectives  via  (23)  are  constrained  by  the  stability 
robustness  requirement  via  (28). 

Note  also  from  (27)  that  the  asymptotic  loop  transfer  function  in  the  vicinity  of 
crossover  is  proportional  to  1/ui  (-1  slope  on  log-log  plots) .  This  is  a  relatively 
slow  attenuation  rate  which  is  the  price  the  regulator  pays  for  its  excellent  return 
difference  properties  (24).  If  f.^1  (id)  attenuates  faster  than  this  rate, 
further  reduction  of  ujc  may  be  required.  It  is  also  true,  of  course,  that  no 
physical  system  can  actually  maintain  a  l/oo  characteristic  indefinitely  (7).  This  is 
not  a  concern  here  since  T(s)  is  a  nominal  (design)  function  only  and  will  later  be 
approximated  by  one  of  the  full-state  loop  transfer  recovery  procedures. 

Full-State  Loop  Transfer  Recovery 

As  described  earlier,  the  full-state  loop  transfer  function  designed  above  for 
point  (ii)'  can  be  recovered  at  point  (ii)  by  a  modified  KBF  design  procedure.  The 
required  assumptions  are  that  r  >  m  and  that  C4>B  is  minimum  phase.  The  procedure 
then  consists  of  two  steps: 

(i)  Append  additional  dummy  columns  to  B  and  zero  row  to  Kc  to  make  C^B  and 
Kf.'t'B  square  (r  x  r).  CtB  must  remain  minimum  phase. 

(ii)  Design  the  KBF  with  modified  noise  intensity  matrices, 

E(r,fT)  =  (rrT  +  qBBT]  8  (t  -  t) 

where  ITt  is  the  nominal  noise  intensity  matrix  obtained  from  stochastic 
models  of  the  plant  and  q  is  a  scalar  parameter  which  will  be  allowed  to  take 
on  a  sequence  of  increasingly  larger  values. 

Under  these  conditions,  it  is  known  that  the  filter  gains  Kf  have  the  following 
asymptotic  behavior  as  q  =?•  °°  (17): 

Kf//cT=£>BW  (29) 

Here  W  is  another  orthornormal  matrix,  as  in  (26).  When  this  Kf  is  used  in  the  loop 
transfer  expression  for  point  (ii),  we  get  point-wise  loop  transfer  recovery  as  q 
I.e., 

K  (  s )  G  (  s )  =  Kc($-1  +  BKC  +  KfC|*1KfC4'B 

=  Kc($  -  $K  f ( I r  *  C*Kf)-l  C$)  KfC^B 

=  Kc{  K f ( I r  +  C?Kf)-lc$B 

‘The  l^O .  5  bound  turns  out  to  be  tight  for  pure  gain  changes;  i.e.,  0. 5=^-6  db, 
which  is  identical  to  regulator's  celebrated  guaranteed  gain  margin  (20).  The  bound  is 
conservative  if  the  uncertainties  are  known  to  be  pure  phase  changes,  i.e.,  0.5  30 

deg,  which  is  less  than  the  known  +  60  deg  guarantee  (20). 

••This  spec  t ic  limiting  process  is  appropriate  for  the  so-called  generic  case  (22) 
with  full  rank  HB.  More  general  versions  of  (28)  with  rank  (HB)<  m  are  derived  in 
Reference  I  25  I . 


(30) 

(31) 

(32) 
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=*>KC$  B(C$B)-1  CtB  (33) 

=  Kc$B(Ir  +  KC4B) -l(C*B(Ir  +  Kc  tB)-l)-l  C4B  (34) 

=  {Kc*B(C*B)'MC4>B  (35) 

»  KC»B  (36) 

A 

In  this  series  of  expressions,  ♦  was  used  to  represent  the  matrix  (sln  -  A  + 

BKc)"!,  equation  (29)  was  used  to  get  from  (32)  to  (33),  and  the  identity  4B  = 

♦B(I+KC  4B)~1  was  used  to  get  from  there  to  (34).  The  final  step  shows 


explicitly  that  the  asymptotic  compensator  K(s)  (the  bracketed  term  in  (35))  inverts 
the  nominal  plant  (from  the  left)  and  substitutes  the  desired  LQR  dynamics.  The  need 
for  minimum  phase  is  thus  clear,  and  it  is  also  evident  that  the  entire  recovery 
procedure  is  only  appropriate  as  long  as  the  target  LQR  dynamics  satisfy  Figure  2’s 
constraints  (i.e.,  as  long  as  we  do  not  attempt  inversion  in  frequency  ranges  where 
uncertainties  do  not  permit  it).  Closer  inspection  of  (30)  -  (36)  further  shows  that 
there  is  no  dependence  on  LQR  or  KBF  optimality  of  the  gains  Kc  or  Kf.  The 
procedure  requires  only  that  Kf  be  stabilizing  and  have  the  asymptotic  characteristic 
(29).  Thus,  more  general  state  feedback  laws  can  be  recovered  (e.g.,  pole  placement), 
and  more  general  filters  can  be  used  for  the  process  (e.g.,  observers.)*. 

Dual  Results 


As  indicated  earlier,  the  full-state  design  and  recovery  procedures  for  points  (i) 
and  (ii)  in  Figure  3  are  mathematical  duals.  This  means  that  the  design  equations 
which  were  developed  for  point  (ii)  above  can  be  transformed  to  point  (i)  by  symbol 
substitutions  and  transpositions.  The  major  results  and  corresponding  equations  are 


the  following. 

Full-State  KBF  Design: 

T(s)  =  C  *  (s)  K f  (19)  ’ 

( Ir+T)  ( Ir+T)  *  =  Ir  +  (C*n  (COT)  */g;  for  s  =  jw,  0  <  u>  <  »  (20)  ’ 

a  [Ir  +  T]  =  /l  +o2  (c  4  r)/u  >1  (22)’  (24)  1 

o(T)  ;  a  (C  »  n//g;  for  o[T]  >>  1  (23)' 

/u"Kf  =£■  wr  as  U  =r>  0  (26)' 

“cmax  =  a[Cr|//IT  (28)' 


Note  that  the  "weight"  selection  process  to  achieve  good  singular  values  now  focuses  on 
the  matrix  T  and  the  scalar  u  instead  of  H  and  p. 

Full-State  Loop  Transfer  Recovery  Steps: 

(i)  Assume  m  >  r.  Append  dummy  rows  to  C  and  Kf  such  that  C  *  Kf  and  C  *  B 
are  square  (mxm)  and  C  *  B  is  minimum  phase. 

(ii)  Design  the  LQR  with  modified  weighting  matrices: 

Q  =  HtH  +  q  CTC 

Then,  as  q  =r-  “,  we  get 

Kc/^=-,WC  (29)' 

and 

G ( s ) K (s)  =4>  C*B  ((C»B)_1  C4Kf } 

=  C»Kf 

Note  that  asymptotic  inversion  now  occurs  from  the  right  and  that  it  is  again 
appropriate  only  as  long  as  the  target  KBF  dynamics  satisfy  Figure  2’s  constraints. 

4 . 0  An  Example 

The  full-state  design  and  full-state  recovery  processes  described  above  together 
provide  a  systematic  way  to  shape  the  singular  value  plots  of  multivariable  feedback 
loops.  This  is  illustrated  by  the  following  abstracted  longitudinal  control  design 
example  for  a  CH-47  tandem  rotor  helicopter.  Our  objective  is  to  control  two  measured 
outputs  --  vertical  velocity  and  pitch  attitude  --  by  manipulating  collective  and 
differential  collective  rotor  thrust  commands.  A  nominal  model  for  the  dynamics 
relating  these  variables  at  40  knot  airspeed  is  (26) 


(35)  ' 

(36)  ’ 


•Still  more  generally,  the  modified  KBF  procedure  will  actually  recover  full-state 
feedback  loop  transfer  functions  at  any  point,  u',  in  the  system  for  which  C  ♦  B'  is 
minimum  phase  (17). 
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where  the  state  vector  x  consists  of  forward  velocity,  vertical  velocity,  pitch  rate, 
and  pitch  attitude.  The  measurements  are  vertical  velocity  in  ft/sec  and  attitude  in 
degrees.  Major  unstructured  uncertainties  associated  with  this  model  are  due  to 
neglected  rotor  dynamics  and  unmodeled  rate  limit  nonlinearities.  These  are  discussed 
at  greater  length  in  [27|.  For  our  present  purposes,  it  suffices  to  note  that  they  are 
uniform  in  both  control  channels  and  that  £m(o))  >1  for  all  io>_10  rad/sec. 

Hence,  the  controller  bandwidth  should  be  constrained  as  in  Figure  2  to  Wcmax  1  10. 

Since  our  objective  is  to  control  two  measured  outputs  at  point  (i),  the  LQG 
design  iterations  utilize  the  dual  equations  (19)  '-(36)  '.  They  begin  with  a  full  state 
KBF  design  whose  noise  intensity  matrices,  E  ( FF,T)  =  (TTf>(  t-r)  and  E(nnT)  = 
ul8(t-i),  are  selected  to  meet  performance  objectives  at  low  frequencies;  i.e., 

0  |T|  ;  o  (C  $n//7  >  ps(0J>,  (37) 

while  satisfy:ng  stability  robustness  constraints  at  high  frequencies; 

“cmax  =  !cr]//[T  <  10  r/sec  (38) 

For  the  choice  r  =  B,  equation  (38)  constrains  u  to  be  greater  than  or  equal  to 
unity.*  The  resulting  KBF  loop  transfer  for  u  =  1  is  shown  in  Figure  4.  For 
purposes  of  illustration,  this  function  will  be  considered  to  have  adequate  high  gain 
properties  for  performance  with  low  gains  beyond  w  =  10  for  stability  robustness.** 

It  then  remains  to  recover  this  function  by  means  of  the  full-state  recovery  procedure 
for  point  (i).  This  calls  for  LQR  design  with  Q  =  HTH  +  q  CTC  and  R  =pl.  Letting 
H  =  0,  o=l,  the  resulting  LQG  transfer  functions  for  several  values  of  q  are  also 
shown  in  Figure  4.  They  clearly  display  the  pointwise  convergence  properties  of  the 
procedure. 


<0  10  100 
Fftf QUfNCV  n/S 


Figure  4. 

*If  CT  (or  HB)  is  singular,  equations  (38)  o:  (28)  are  still  valid  in  the  non-zero 
directions. 

**The  function  should  not  be  considered  final,  or  course.  Better  balance  between  o 
and  o  and  greater  gain  at  low  frequencies  via  appended  integrators  would  be  desirable 
in  a  serious  design. 
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5 . 0  Conclusions 


We  have  shown  how  the  modern  time  domain  LQG  methodology  can  be  used  to  design 
multivariable  feedback  controllers  which  satisfy  practical  design  requirements 
expressed  as  singular  value  conditions  in  the  frequency  domain.  The  design  process  was 
broken  into  two  basic  steps.  First,  a  full-state  LQR  (or  KBF)  feedback  law  is  designed 
whose  singular  value  properties  meet  design  objectives  --  high  gains  where  needed  for 
performance,  low  gains  where  needed  for  uncertainty  tolerance,  and  well-behaved 
corssovers  in  between.  Quadratic  weights  for  this  step  are  determined  explicitly  by 
known  relationships  between  the  weights  and  resulting  full-state  controller's  singular 
value  properties.  In  the  second  step,  this  full-state  loop  transfer  matrix  is  then 
approximated  with  a  realizable  LQG  compensator  using  a  loop  transfer  recovery 
procedure.  This  procedure  involves  the  traditional  KBF  design  (or  LQR  design, 
respectively),  but  with  modified  noise  statistics.  The  modifications  cause  the  LQG 
loop  transfer  matrix  to  approach  the  full-state  matrix  arbitrarily  closely  at  fixed  s. 
The  overall  design  process  was  illustrated  with  a  simple  example  which  shows  it  to  be  a 
direct  and  convenient  way  to  manipulate  multivariable  singular  value  properties. 
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Summary 

This  paper  describes  the  basic  design  algorithms  needed  for  frequency  domain  oriented 
Linear-Quadratic-Gaussian  feedback  design  and  introduces  an  experimental  interactive 
computer  aided  design  package  through  which  these  algorithms  can  be  effectively 
acessed.  The  algorithms  and  design  package  are  illustrated  with  several  flight 
control  design  examples  for  highly  maneuverable  aircraft. 


1.0  INTRODUCTION 

In  an  earlier  paper  in  these  proceedings  [11,  a  procedure  was  discussed  by  which  the 
Linear-Quadratic-Gaussian  (LQG)  synthesis  methodology  [2)  can  be  used  for 
frequency-domain  oriented  multivariable  control  system  design.  This  procedure  is  based 
on  the  interpretation  of  various  design  requirements  as  specifications  on  the  singular 
value  plots  of  the  design's  loop  transfer  function  matrices.  The  resulting  desired 
loop  shapes  are  then  achieved  with  a  two-step  LQG  process: 

Step  1:  Full-state  design  with  quadratic  weights  or  noise  statistics  selected  to 
satisfy  the  singular  value  specifications. 

Step  2:  Full-state  loop  transfer  recovery  with  modified  filter  or  regulator  design 
parameters. 

Except  for  the  most  elementary  design  problems,  these  steps  cannot  be  carried  out 
effectively  with  manual  calculations.  Computerized  design  assistance  is  essential, 
either  in  the  traditional  batch-processing  mode  or,  preferably,  in  an  automated 
interactive  design  environment. 

This  paper  discusses  some  of  the  basic  elements  of  a  computerized  LQG  design  process. 

It  begins  with  a  brief  review  of  the  LQG  methodology.  The  core  algorithms  needed  for 
LQG  synthesis  are  then  discussed,  and  a  prototype  command  structure  is  introduced  for 
their  interactive  computer  implementation.  This  command  structure  is  currently  hosted 
on  a  Honeywell  Multics  system  and  is  being  used  for  various  interactive  control  design 
experiments.  The  structure  is  illustrated  with  several  aircraft  flight  control  design 
examples. 

2.0  REVIEW  OF  THE  LQG  LOOP  SHAPING  PROCESS 

As  in  [1]  and  [3],  we  will  deal  with  the  generic  feedback  loop  shown  in  Figure  1.  It 
consists  of  the  interconnected  plant  (G)  and  compensator  (K)  forced  by  commands  (r), 
measurement  noise  ( n) ,  and  by  a  set  of  disturbances  (d)  which  are  assumed  to  be 
reflected  to  the  measured  output  (y).  The  dashed  precompensator  (P)  is  an  optional 
element  used  to  achieve  deliberate  command  shaping  or  to  represent  a  non-unity  feedback 
system  in  equivalent  unity  feedback  form.  All  signals  are  taken  to  be  multivariable, 
in  general,  and  both  nominal  mathematical  models  of  G  and  K  are  finite  dimensional 
linear  t ime- i nvar iant  systems  with  transfer  function  matrices  G(s)  and  K(s). 


d 


Figure  1.  Standard  Feedback  Configuration 
Basic  Design  Requirements 

As  discussed  in  [3],  the  feedback  system  in  Figure  1  has  good  performance  properties 
whenever  the  inverse  return  difference  operator  (I+GK)"1  is  "small".  For  scalar 
transfer  functions,  the  appropriate  notion  of  smallness  can  be  captured  by  absolute 
values;  i.e.,  we  typically  require  that  1/ 1 1  +g  ( jui)  k  ( jw)|  be  small  compared  to  unity 


in  a  frequency  range  where  commands  and/or  disturbances  are  large.  This  is  typically 
the  low  frequency  range,  say  ui  <  w0.  In  the  multivariable  case,  an  analogous 
notion  of  smallness  has  been  defined  based  on  singular  values  [3].  We  require  that  the 
maximum  singular  value  of  (I+GK)"1-,  denoted  by  o  [  ( I+GK  (  ju>)  )  " 1  ]  ,  be  small 
compared  to  unity  over  the  designated  frequency  range.  The  converse  of  this 
requirement  is  that  the  return  difference  operator  I+GK  be  "large”,  i.e., 

o  [I+GK(jn)))  >  ps(w)  >>  1 

for  all  u  >  ug,  (1) 

where  ps(u>)  is  a  scalar  function  typically  available  as  an  explicit  frequency  domain 
performance  specification.  With  ps(w)  sufficiently  large,  this  expression  simply 
states  that  the  loop  gains  o(GK(jw)|  must  themselves  be  large.  Evidently,  good 
feedback  design  boils  down  to  achieving  high  loop  gains  in  the  necessary  frequency 
range. 

Of  course,  this  latter  statement  expresses  only  half  of  the  feedback  design  problem. 

In  real  physical  feedback  systems,  loop  gains  can  never  be  made  arbitrarily  high  over 
arbitrarily  wide  frequency  ranges.  Rather,  they  are  constrained  by  our  desire  to 
suppress  sensor  noise  responses  and  also  by  the  requirement  to  remain  stable  in  the 
presence  of  model  uncertainties. 

The  second  source  of  limitations  is  commonly  the  more  severe.  The  feedback  system  is 
required  to  operate  successfully  in  the  presence  of  an  infinite  variety  of  differences 
between  the  actual  plant  and  its  design  model  G(s).  A  very  useful  way  to  express  this 
variety  of  possible  differences  is  in  the  so-called  unstructured  multiplicative  form: 

G'  (s)  =  [  I+L ( s) 1  G ( s)  (2) 

with  o"[ L ( j w)  1  < 

Here  the  actual  plant  G'  is  represented  by  its  nominal  model,  G(s),  modified  by  a 
multiplicative  perturbation,  L(s),  whose  characteristics  are  known  only  by  the 
magnitude  bound  tm(oo)  .  Typically,  this  bound  is  small  (tm<<l)  at  low 
frequencies,  but  grows  with  increasing  frequencies  and  eventually  exceeds  unity  at 
frequencies  u  > 

As  shown  in  (31,  a  nominally  stable  feedback  system  will  remain  stable  in  the  face  of 
unstructured  uncertainties  of  the  type  (2)  if  and  only  if 

o  ( 1+  (GK) -1)  >  (3) 

for  all  0  <  w  <“ 

This  equation  is  a  multivariable  generalization  of  the  familiar  single  loop  requirement 
that  loop  gains  be  small  whenever  uncertainties  are  large.  In  fact,  whenever 
>>1,  we  get  the  following  constraint  on  GK: 

”[GK(  ju»  ]  <  l/im(oo)  <<  1  (4) 

for  all  uj  such  that  (.m(u>)  <<  1 

Based  upon  (1)  -  (4),  it  follows  that  the  fundamental  feedback  design  problem  becomes 
one  of  finding  a  compensator  K(s)  which  shapes  the  loop  transfer  matrix  GK(s)  in  such  a 
way  that 

a)  the  loop  gains  o(GK|  are  high  at  low  frequencies  to  meet  performance 
requirements  ( 1)  , 

b)  the  loop  gains  o[GK]  are  low  at  high  frequencies  to  meet  the  stability 
robustness  requirements  (4),  and 

c)  the  transitions,  or  "crossovers",  between  these  two  regions  are 
"well-behaved".  This  means  that  they  are  stable  (i.e.,  det(I+GK)  satisfies 
the  multivariable  Nyquist  Criterion  (4))  and  that  I+GK  and  I+(GK)"1  are  not 
unduly  small. 

LQG  Loop-Shaping 

Although  the  L inea r -Quadra t ic-Gauss ian  methodology  was  developed  purely  from  a  time 
domain  optimization  perspective,  we  saw  in  (1)  that  it  proved  to  be  remarkably 
effective  as  a  tool  for  the  frequency-domain  loop  shaping  task  outlined  above.  In 
order  to  briefly  review  the  results  of  (1),  we  will  deal  with  the  standard  LOG 
controller  configuration  shown  in  Figure  2.  It  consists  of  the  well-known  cascaded 
combination  of  a  Kalman-Bucy  filter  (KBF)  with  a  full-state  1  inear-quadrat ic  regulator 
(LOR).  Beyond  this  special  internal  structure,  the  combination  comprises  an  ordinary 
linear  finite  dimensional  compensator  with  transfer  matrix 

K ( s )  =  Kc(sI-A+BKc+KfC)-lKf  (5) 

where  the  standard  matrix  triple  (A,  B,  C)  has  been  used  to  represent  a  state  space 
realization  of  the  plant,  and  the  symbols  Kc  and  Kf  are  used  to  represent  the  LOR 
and  KBF  gain  matrices,  respectively,  as  defined  by  their  individual  algebraic  Riccati 
equations  ( 1 | : 
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PA  +  AtP  +  Q  - 

PBtR_1BP 

(6) 

ZCN*1 

AZ  +  ZAt  +  M  - 

ZCN'icTZ 

(7) 

The  loop  transfer  matrices  of  the  LQG  controller  are  then  either  G(s)K(s),  for  the  loop 
broken  at  point  (i)  in  Figure  2,  or  K(s)G(s),  for  the  loop  broken  at  point  f i i ) .  In 
general,  these  functions  have  no  special  properties  except  nominal  closed  loop 
stability.  However,  if  the  plant  G(s)  is  square  and  minimum  phase,  and  if  we  design 
the  LQR  with  the  following  weighting  matrices; 


Q  =  qC^C  (scalar  q  =$>  “)  (8) 

R  =  pi  (scalar  p  >  0  and  fixed) 


then  the  loop  transfer  function  GK(s)  will  approach  a  certain  full-state  loop  transfer 
function  arbitrarily  closely  as  q  0  ;  namely 

GK  ( s)  =>  C(sI-A)_1Kf  =  T  ( s)  (9) 

The  function  T(s)  turns  out  to  be  the  loop  transfer  function  of  the  Kalman-Bucy 
filter's  error  dynamics  and  can  be  readily  adjusted  to  exhibit  all  three  of  the 
loop-shaping  design  objectives  discussed  above.  In  particular,  if  we  let  the  filter's 
noise  statistics  be 

m  =  rrT 

N  -  ul,  (scalar  u  >0)  (10) 

then  T(s)  in  (9)  satisfies 

a)  0  (T  ( j(D)  ]  ;  a(C(joiI-A) -lr|//u"  (11) 

for  all  to  such  that  a(T)>>l 

b)  cj  (T  ( ju>)  J  =*■  ofcri/ai/iT  (12)* 

as  id  =^.  <«,  u  0  with  «/ g”  constant 

c)  a(I+T(j<u)  1  >  1  (13) 

and 

at  I+T-l< jw) 1  >  1/2  (14) 

for  all  o  <  id  <  • 


compensator  k(s) 


Figure  2.  LQG  Controller 

Note  that  property  a)  provides  a  direct  way  to  satisfy  our  high  gain  objectives  at  low 
frequencies.  The  gains  are  simply  given  by  the  noi se-to-output  transfer  function 
C  (si -a)  "1  r//iT,  with  r  and  u  at  our  discretion.  Similarly,  property  b) 
provides  a  way  to  meet  the  stability  robustness  requirement  at  high  frequencies.  It 
shows  that  the  loop  gains  become  small  at  an  asymptotic  rate  proportional  to  1/<d  and 
achieve  their  last  crossover  at  the  frequency 

“cmax  “  o(crj//( T.  (15) 

Finally,  property  c)  shows  that  all  crossovers  will  be  "well-behaved".  The  return 
difference  always  exceeds  unity,  the  inverse  return  difference  always  exceeds  0.5,  and 
of  course,  the  nominal  loop  is  guaranteed  to  be  stable. 

Based  on  the  above  summary  of  frequency  domain  properties  of  LQG  controllers,  the 
following  simple  loop-shaping  procedure  suggests  itself: 

Step  1.  Design  a  KBF  with  r  and  u  selected  such  that  the  loop  transfer  function 
T(s)  meets  performance  and  stability  robustness  requirements. 


This  equation  applies  to  the  "generic"  case  with  Cf  full  rank  |1). 
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Step  2.  Design  a  sequence  of  LQR’s  with  weights  selected  according  to  (6)  and  q 

allowed  to  take  on  consecutively  larger  values.  Then  select  an  element  of  the 
resulting  sequence  of  transfer  functions 

GK  ( s)  =^>  T  ( s ) 


which  adequately  "recovers”  the  desired  function  over  the  frequency  range  of 
interest.  All  design  objectives  including  nominal  stability  are  then  assured. 

As  discussed  in  (1|,  a  dual  version  of  this  process  is  also  available  for  shaping  the 
loop  transfer  function  KG(s)  at  point  (ii)  in  Figure  2.  For  this  point,  however,  the 
role  of  the  filter  and  controller  are  reversed.  We  begin  by  designing  an  LQR  whose 
loop  transfer  Kc(sI-A)*1b  has  good  frequency  domain  properties  and  then  design  a 
sequence  of  KBFs  which  serve  to  "recover"  this  function.  Equations  for  this  alternate 
procedure  are  mathematical  "duals"  of  the  results  given  above.  Both  procedures  are 
illustrated  in  the  design  examples  of  Section  4. 

3.0  COMPUTATIONAL  TOOLS  FOR  LQG  DESIGN 


As  noted  earlier,  the  design  steps  outlined  above  can  be  carried  out  manually  for  only 
the  most  elementary  problems.  Even  then,  the  calculations  are  likely  to  be  tedious. 
Hence,  serious  use  of  the  LQG  method  in  the  design  environment  of  industry  and  in  the 
teaching  environment  of  academe  requires  computerized  assistance.  The  basic  algorithms 
needed  for  this  purpose  and  a  prototype  interactive  command  structure  for  easy  user 
access  to  these  algorithms  are  discussed  below. 


Basic  Algorithms 


It  follows  from  Section  2  that  the  core  computing  tasks  of  our  frequency-domain- 
oriented  LQG  synthesis  process  include  the  following  operations: 

1.  Riccati  solutions  for  gains  Kc  and  Kj 

2.  Frequency  response  evaluat ionsf or  G(jui),  K(jui),  GK(jio),  etc. 

3.  Singular  value  decompositions 

FORTRAN  subroutines  for  these  operations  are  available  on  many  scientific  computing 
systems.  Riccati  equation  solvers,  for  example,  exist  in  several  software  packages 
[5-7].  They  generally  compute  solutions  for  equation  (6)  and  for  its  mathematical 
(7)  by  means  of  the  so-called  Potter  algorithm  [8].  This  algorithm  expresses  the 
solution  matrix,  p,  in  terms  of  a  full  eigenvalue/eigenvector  decomposition  of  the 
2n-dimensional  Hamiltonian  system  associated  with  the  LQ-optimization  problem  [9]. 
That  is,  let 


H  = 


A  -BR-1BT 
-Q  -AT 


=  w 


W-l 


10  A*  I 

where  H  is  the  Hamiltonian  system  matrix.  A”  =  diag(lj)  and  A+  = 

diag(>*)  are  its  stable  and  unstable  (mirror  image)  eigenvalues  respectively, 

and  where  W  denotes  the  corresponding  matrix  of  eigenvectors.  Then  P  is  defined  in 

terms  of  a  2x2  block  partition  of  W  as 


dual 


P  =  W2i  (WU)  -1 


(17) 


An  alternate  algorithm  is  available  due  to  Kleinman  [10]  which  utilizes  Newton 
iterations  to  solve  the  Riccati  equation.  It  requires  a  stabilizing  initial  guess  for 
K  or  P  and  then  iterates  as  follows: 


(i)  Let  K  =  R‘1BTP0 

(ii)  Solve  0  =  P^ (A-BK)  +  (A-BK)TP1  +  Q  +  KRKT  for  Pj  (18) 

(iii)  If  MPj^  -  p o 1  l  >  e  return  to  (i)  with  Pg  =  Pi 
Otherwise  terminate  with  P  =  Pi 


Still  more  recently,  an  algorithm  has  been  introduced  by  Laub  [11]  which  generalizes 
Potter's  method  by  using  a  real  Schur  form  in  place  of  the  full  eigenvalue/vector 
decomposition  to  isolate  the  stable  subspace  of  the  Hamiltonian  system.  In  the  real 
Schur  form,  the  matrix  analogous  to  A~  in  (16)  is  upper  triangular  with  real 
eigenvalues  on  the  diagonal  and  selected  nonzero  entries  on  the  first  subdiagonal 
corresponding  to  pairs  of  complex  conjugate  roots.  In  addition,  the  transformation  W 
is  real  and  orthonormal  (i.e.,  =  WT) .  This  offers  substantial  numerical 

advantages  and  makes  Laub's  algorithm  probably  the  most  convienent  and  numerically 
robust  Riccati  solver  available  at  this  time. 


Transfer  function  and  frequency  response  evaluations  are  also  standard  operations  in 
many  computing  systems.  Given  a  state  space  tripe  A,  B,  C  (or  more  generally,  a 
quadrouple  A,  B,  C,  D)  ,  its  transfer  function  at  fixed  frequency,  say  s  =  juij,  is 
defined  as 


G(jiuj)  =  C(ju)jl-A)'1  B  +  D 

I 


(19) 
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This  expression  is  usually  evaluated  in  two  steps.  First,  the  linear  equation  system 

(jwil-A)  X  »  B  (20) 

is  solved  for  X  via  Gaussian  elimination  (LU  factorization  followed  by  solution  of  a 
triangular  system  (121)  and  then  G  is  evaluated  as 

G  =  CX  +  D  (21) 

These  steps  require  approximately  (n3/3+  mn^  +  rmn)  multiply-add  operation  [13|, 
where  n  is  the  state  space  dimension  of  the  system  (i.e.,  n  =  dim  A) ,  m  is  the  number 

of  inputs,  and  r  is  the  number  of  outputs.  Note  that  this  operation  count  is 

proportional  to  n^  and  that  it  must  be  repeated  for  each  frequency  point  of 
interest.  Hence,  if  the  state  space  dimension  is  high  and  if  the  number  of  frequency 
samples  is  large,  frequency  response  evaluation  can  consume  considerable  computing 
time.  These  time  demands  can  be  reduced,  however,  by  utilizing  transformed  state  space 
representations.  For  example,  a  full  eigenvalue/vector  decomposition  of  matrix  A  can 
be  used  to  convert  the  system  A,  B,  C,  D  into  its  diagonal  equivalent 

diag(Xi)  ,  W^B,  CW,  D  (22) 

While  this  involves  a  one-time  overhead  of  O(n^)  operation,  all  subsequent 
evaluations  of  ( 20 ) - ( 2 1 )  require  only  (mn  +  rmn)  operations.  Hence,  the  total 
computing  time  can  be  reduced  considerably.  Unfortunately,  complete  eigenvalue/vector 
decompositions  are  often  plagued  with  numerical  i) 1-conditioning  ,  and  hence,  the  form 
(20)  is  not  always  reliably  available.  An  alternate  transformation,  which  avoids  this 
difficulty  and  still  achieves  significant  computer  savings  is  the  Hessenberg  form. 

This  form  is  upper  triangular  with  nonzero  entries  on  the  first  subdiagonal.  Like  the 
Schur  form,  it  is  achieved  with  real  orthogonal  transformations  using  a  one-time 
overhead  of  5n3/3  operations.  Its  structure  remains  intact  when  we  add  jujl,  so 
( 20 ) - ( 21)  can  be  solved  for  each  subsequent  frequency  point  with  approximately  (mn2/2  + 
rmn)  operations  (13).  Hence,  whenever  n  is  large  compared  with  m  and  full 
eigenvalue/vector  decompositions  are  unreliable,  the  Hessenberg  form  provides  an 
effective  alternative  transformation  procedure. 

The  third  basic  computing  task  for  LQG  design  involves  singular  value  decompositions, 
or  SVDs.  These  are  also  available  as  standard  routines  on  many  computing  systems.  They 
convert  the  rxm  matrix  G(jui)  into  the  following  factors: 

G(j<D)  =  U  t  VH  (23) 

where  U  and  V  are  (rxr)  and  (mxm)  unitary  matrices  (i.e.,  UHU  =  I,  VHV  =  I), 
respectively,  (  )"  denotes  complex  conjugate  transpose,  and  Z  is  an  rxm  matrix 

with  entries  only  along  the  main  diagonal.  These  entries  are 

°l  1  °2  1  1  •**  >  Op  >  o  (24) 

with  p  =  min(r,m).  The  two  extreme  values,  o^  and  Op,  correspond  to  the 
maximum  and  minimum  singular  values,  a  and  o.  Their  significance  with  respect  to 
LOG  design  has  already  been  discussed. 

The  computation  of  SVD's  relies  on  a  sequence  of  orthogonal  equivalence  transformations 
whose  details  are  described  in  (12],  This  same  reference  also  provides  FORTRAN 
listings  of  an  SVD  subroutine.  SVD's  consume  approximately  6r^m  multiply-add 
operations.  While  this  is  quite  large  compared  with  other  equal-size  matrix  operations 
(e.g.,  inversion,  eigenvalue/vector  decomposition),  the  dimensions  r  and  m  are  usually 
modest  and  other  calculation  costs  dominate  LQG  design. 

In  addition  to  these  three  basic  computing  algorithms  needed  for  LQG  synthesis,  an 
effective  computerized  design  environment  must  also  provide  software  facilities  for 
more  routine  matrix  manipulations  (eg.  sums,  products,  inverses,  etc.),  for 
input/output,  data  plotting,  and  for  other  data  management  functions.  An  example  of 
these  essential  support  capabilities  is  given  later  in  this  section. 

Interactive  Environments 

In  traditional  batch-processing  computer  systems,  basic  computing  algorithms  such  as 
the  ones  described  above  are  accessed  by  the  user  as  subroutines.  They  are  assembled 
into  a  desired  fixed  sequence  of  operations  by  a  user-written  main  program  which  also 
handles  routine  manipulations,  input/output,  data  management,  etc.  This  program  is 
submitted  to  the  computing  facility  where  it  is  compiled  and  executed.  Results  are 
then  returned  to  the  user  for  evaluations,  program  modifications,  and  re-submittal. 
While  this  process  is  a  substantial  improvement  over  manual  computations,  it  is  still 
slow,  prone  to  error,  and  frustrating  for  the  designer.  As  a  result,  considerable 
research  effort  and  hardware/sof tware  development  has  been  devoted  to  its  improvement. 

One  of  the  results  of  these  R&D  efforts  is  the  interactive  approach  to  computer-aided 
design.  In  this  approach,  the  designer  interacts  with  the  computer  directly,  flexibly, 
and  in  apparent  real  time.  He  is  usually  provided  with  the  following  facilities: 


a  graphics  terminal 

a  software  library  of  design  programs 

a  high-level  command  language 

editing  and  macro  command  generation  software. 

The  terminal  provides  real  time  computer  access  and  control  with  effective  graphic 
communication  of  design  status,  results  and  tradeoffs.  It  is  usually  augmented  with 
hardcopy  equipment  and  off-line  printing  facilities  to  provide  permanent  records  of  the 
communication.  The  subroutine  library  contains  all  design  algorithms  needed  for  a 
specified  class  of  design  problems  (eg.  Riccati  solvers,  SVD's,  etc.  for  LQG  design). 
The  command  language  provide  a  simple  means  to  invoke  these  high-level  routines  in  any 
user-desired  sequence  without  concerns  about  input/output,  data  handling,  internal  data 
structures  and  numerics.  A  command  language  should  be  contrasted  with  alternative 
"question  and  answer”  approaches  to  computer  interaction.  In  the  latter,  the  computer 
program  guides  the  user  through  a  pre-programmed  tree  of  fixed  design  sequences  with  a 
controlled  question  and  answer  dialogue.  This  approach  has  been  shown  to  lack  the 
flexibility  required  for  a  wide  spectrum  of  users  (14).  Finally,  the  editing  and  macro 
capability  provides  a  way  to  define,  save  and  execute  fixed  sequences  of  user-specified 
commands.  In  effect,  a  user  can  create  higher-level  commands  suitable  for  his  own 
design  task.  These  enhance  the  power  of  the  basic  command  language.  Macro  facilities 
have  been  found  to  be  a  key  element  in  the  success  of  interactive  design  [14). 

An  Illustrative  Interactive  System 

It  is  not  our  intent  here  to  discuss  the  various  issues  of  interactive  computer-aided 
design  in  detail.  This  is  done  in  [14].  Rather,  the  intent  is  to  illustrate  the  use 
of  interaction  as  it  might  apply  to  the  LQG  design  process.  This  will  be  done  by  means 
of  an  example  interactive  program  package,  HONEY-X,  which  has  been  constructed  on 
Honeywell's  Multics  computer  system  [15].  The  purpose  of  this  package  has  been  to 
support  multivariable  control  design  research  [16-17].  As  such,  the  package  is  still 
in  a  state  of  flux  and  somewhat  limited  in  scope.  It  does,  however,  provide  an 
effective  illustration  of  the  power  of  interactive  design. 

HONEY-X  consists  of  a  library  of  standard  ANSI  FORTRAN  design  programs  which  are 
controlled  and  accessed  through  the  command  and  file  management  facilities  offered  by 
Multics.  Multics  also  provides  editing  and  macro  command  generation  capabilities.  The 
package  is  organized  into  three  hierarchial  levels,  as  illustrated  in  Figure  3.  The 
levels  are: 

System  Executive--This  level  is  an  executive  program  written  in  the  Multics  Command 
Language.  It  is  responsible  for  establishing  and  maintaining  the  design  system 
environment.  Upon  entry  with  the  command  ”ec  matrix”,  it  makes  the  design  system 
library  available  to  the  user  by  designating  subprogram  locations  within  the 
Multics  file  system.  Next,  the  system  executive  solicts  a  command  line  from  the 
user  and  performs  first-level  parsing  to  determine  to  which  of  the  following  clases 
the  command  belongs: 

1.  HONEY-X  system  command 

2.  User  Macro  command 

3.  Multics  command 

The  command  is  then  passed  to  the  appropriate  processor  for  execution.  The  system 
executive  continues  to  solicit  commands  until  it  is  terminated  with  the  quit 
command  ”q". 

Program  Executive--This  level  consists  of  executive  programs  which  interface 
between  the  system  executive  and  the  requested  object  programs.  Its  primary 
responsibility  is  to  direct  input  to  and  output  from  the  file  system.  It  must 
perform  second-level  parsing  in  order  to  determine  which  files  should  be  attached. 
If  the  command  line  contains  optional  arguments,  these  are  passed  to  the  object 
program. 

Object  Program--This  level  performs  the  actual  computations  which  are  implied  by 
the  command  line.  It  must  check  for  compatability  of  input  data.  If  optional 
arguments  have  been  passed  by  the  program  executive,  it  must  parse  them  and 
determine  if  it  har  been  given  sufficient  information  to  proceed.  If  input 
parameters  are  missing  and  do  not  have  default  values,  they  must  be  solicited  from 
the  user.  Note  that  prompting  for  input  is  provided  only  if  the  required 
information  is  not  present.  This  allows  experienced  users  to  avoid  unnecessary 
input  menus  while  giving  unfamiliar  users  all  the  information  needed.  When 
applicable,  the  object  program  has  the  final  responsibility  of  directing  printed 
output  to  the  user's  terminal  I/O  device. 


Programs  in  HONEY-X's  library  were  selected  and  developed  in  the  course  of  several 
years  of  multivariable  design  research.  Programs  were  added  until  they  formed  a  set 
which  would  allow  most  manipulations  to  be  performed  by  a  straight-forward  combination 
of  two  or  three  commands.  At  that  point,  programs  with  significan'  overlap  in  function 
were  combined.  This  produced  two  basic  sets  of  routine  --  1)  the  "General  Matrix 
Routines"  which  perform  primitive  operations  on  matrices,  and  2)  the  "Special-Purpose 
Programs"  which  perform  more  involved  functions  of  linear  sytems  analysis  and 
synthesis.  The  special  purpose  programs  require  the  existence  of  specific  input 
matrices  that  have  been  named  according  to  the  established  naming  conventions. 
Similarily,  their  output  matrix  names  are  chosen  so  as  to  identify  the  contents.  Table 
1  provides  brief  description  of  both  types  of  routines. 
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TERMINAL  I/O 


Figure  3.  HONEY- X  Structure 
TABLE  X.  HONEY-X  PROGRAM  LIBRARY 
GENERAL  MATRIX  ROUTINES 

A .  Initialization 

martini  ( initialized,  dimensions,  entries) 
matsid  (scalar  times  identity) 

B.  Manipulative 
mataug  (augment) 

matchg  (change  dimensions,  entries) 
matsam  (frequency  sample) 
matsel  (select  rows  and  columns) 
addname  (Multics  system  routine) 
adds  new  file  names 
copy  (Multics  system  routine) 
copies  files 

delete  (Multics  system  routine) 
deletes  files 

rename  (Multics  system  routine) 
changes  file  names 

C.  Display 

matchg  (print  entries  greater  than  a  threshold) 
matplt  (plot) 
matprt  (print) 

matscn  (row  and  column  dimensions;  list  of  contained  frequencies) 

D.  Functions 

matcnj  (conjugate) 
matctp  (conjugate  transpose) 
matdia  (vector  =£■  diagonal) 
matimg  (imaginary  part) 
matinv  (inverse) 
matpol  (rectangular  =£>  polar) 
matrec  (polar  rectangular) 
matrel  (real  part) 
mattrp  (transpose) 

E.  Arithmetic  Operations 
matadd  (addition) 
matdiv  (division) 
matmpy  (multiplication) 

matsel  (multiplication  by  a  scalar) 
matsub  (subtraction) 
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F.  Coordinate  Transformation 

mateig/geneig  (eigenvalue/eigenvector) 

matnrm  (normalize  columns) 

matsvd  (singular  value  decomposition) 

II.  SPECIAL-PURPOSE  ROUTINES 

A .  Transfer  Function  Generation 

c isabd  (from  state-space  quadruple) 
leadlag  (lead/lag  compensator) 
notch  (notch/inverted  notch  filter) 

B.  Analysis 

abk  (closed-loop  eigenvalues) 
svdsys  (multivariable  margins) 
zeros  (transmission  zeros) 

C.  Ricatti  Equation  Solutions 
kalman  (Kalman-Bucy  filter) 

lqgain  (linear  quadratic  optimal  control  gain) 

HONEY-X's  command  language  is  set  up  to  provide  commands  for  each  of  the  programs  in 
the  library.  These  commands  take  the  following  generic  form: 

<command  identified  <argument  list> 

The  <command  identified  is  merely  the  name  of  one  of  the  programs  in  t.he  library 
(eg.  matini,  matplt,  kalman),  and  the  <argument  list>  is  a  string  of  characters 
which  identifies  data  files  to  be  used  by  the  program  and  various  arguments  needed  for 
its  execution.  A  few  examplesserve  to  illustrate  this  command  format: 

>matmpy  sys.b  sys.k  sys.bk 
>matsub  sys.a  sys.bk  sys.a 

>cisabd  sys  -1,2, 0.1  (25) 

>matplt  sys.g/Real  vs.  Frequency/lwlr  all 

The  character  >  in  each  of  these  command  lines  is  a  prompting  symbol  displayed  by  the 
system  executive  on  the  graphics  terminal  to  solicit  a  command.  The  command  identifier 
in  entered  immediately  after  the  prompt.  For  example,  "matmpy"  in  line  1  invokes  the 
matrix  multiply  program.  It  takes  the  matrix  stored  under  file  name  "sys.b", 
multiplies  by  the  matrix  stored  under  filename  "sys.k"  and  stores  the  results  under 
filename  "sys.bk".  The  string  "sys.a  sys.b  sys.bk"  constitutes  a  specific  list  of 
arguments  for  matmpy.  Similarly,  the  second  command  line  invokes  the  matrix 
subtraction  program,  matsub,  which  takes  the  matrix  stored  mode  filename  "sys.a", 
subracts  the  matrix  under  "sys.bk"  just  created  by  matmpy,  and  stores  the  result  under 
"sys.a".  Note  that  the  last  step  deliberately  overwrites  the  original  sys.a  file. 
Another  file  name  supplied  as  a  third  argument  would  leave  sys.a  intact. 


The  third  command  line  invokes  the  special  purpose  routine,  cisabd,  which  evaluates  a 
frequency  response  matrix  according  to  equations  (19)  -  (21).  The  argument  "sys" 
causes  this  program  to  use  A,  B,  C,  and  D  matrices  stored  under  filenames  "sys.a”, 
"sys.b",  "sys.c"  and  sys.d",  respectively,  and  to  store  the  resulting  frequency 
response  under  filename  "sys.g".  In  effect,  the  routine  represents  an  operation 
performed  on  a  system,  not  on  an  individual  data  file  (eg.  argument  "sys”  replaced  with 
"x"  would  cause  matrices  A,  B,  C,  D  stored  under  files  x.a,  x.b,  x.c,  x.d  to  be  used  to 
produce  a  frequency  response  stored  under  x.g).  The  additional  characters  in  the 
argument  string  of  cisabd  specify  the  desired  frequency  range  for  evaluation,  in  this 
case  10~1  to  10+2  rad/sec,  and  the  frequency  increment  between  points,  in  this  case 
100-1  rad/sec  or  ten  points  per  decade.  If  these  latter  three  arguments  are  not 
supplied  on  the  command  line,  cisabd  will  ask  for  them  explicitly  before  execution. 


The  final  command  line  in  our  example  invokes  the  routine  "matplt”  which  provides 
graphical  plotting  capability.  The  argument  list  identifies  that  the  matrix  to  be 
plotted  is  stored  under  filename  "sys.g",  specifies  that  the  plot  be  labeled  "Real  vs. 
Frequency",  and  that  the  x,  y  variables  be  "lw"  for  log  frequency,  and  "lr",  for  log 
real,  respectively.  Scaling  is  automatic  by  the  default  option  "all".  Matplt  will 
proceed  to  generate  the  desired  graph  on  the  terminal  face,  complete  with  axes  and 
labels,  and  the  user  may  then  save  this  plot  as  hardcopy  if  its  results  are 
satisfactory. 


The  file  name  requirements  and  argument  lists  of  other  HONEY-X  commands  are  defined  in 
a  similar  manner  and  are  fully  documented  in  Reference  (16). 


4.0  DESIGN  EXAMPLES 


The  effectiveness  of  HONEY-X  and  the  LQG  design  process  in  Section  2  is  best 
illustrated  with  some  representative  design  examples.  The  examples  selected  here  are 
taken  from  Reference  (181 .  They  are  concerned  with  the  synthesis  of  flight  control 
laws  for  a  highly  maneuverable  aircraft.  The  vehicle  is  a  scaled  version  of  an 
advanced  fighter  presently  being  flight-tested  on  NASA  Dryden's  Remotely  Augmented 
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Vehicle  (RAV)  facility  [19].  While  the  present  emphasis  is  on  validating  aerodynamics, 
control  is  an  important  aspect  of  these  flight  tests  because  the  open  loop  vehicle  is 
highly  unstable.  The  vehicle's  multiple,  independently  controlled  surface  offer  the 
potential  for  high  maneuverability  in  future  experiments  and  an  excellent  test  bed  for 
multivariable  design. 

The  present  baseline  set  of  control  laws  for  the  vehicle's  pitch  axis  includes  a 
single-input  single-output  (SISO)  command-augmentation  system  (CAS)  combined  with 
automatic  limiting  of  load  factor  and  angle-of-attack.  These  control  laws  were 
designed  using  classical  loop-shaping  considerations  [20].  The  control  system  designs 
presented  here  represent  an  attempt  to  achieve  the  same  basic  pitch  axis  performance 
with  the  LQG  method  of  Section  2  and  to  extend  these  designs  to  multivariable  decoupled 
flight  path  control  (DFPC)  functions. 

The  Design  Model 

The  designs  will  be  based  on  a  linear  model  describing  motion  in  the  vertical  plane. 

The  flight  condition  corresponds  to  a  Mach  number  of  0.9  at  25,000  ft.  altitude. 
Linearized  (small  perturbation)  models  for  this  as  well  as  other  flight  conditions  are 
given  in  (20].  For  design  purposes  the  longitudinal  dynamics  are  uncoupled  from  the 
lateral-directional  dynamics.  The  state  vector  consists  of  the  vehicle's  basic  rigid 
body  variables 

8V  forward  velocity 

x=  a  angle-of-attack  (26) 

q  pitch  rate 

9  pitch  attitude 

As  usual,  flight  path  angle  (Y )  is  defined  as  Y  =  0-a.  The  control  surfaces  are 

the  elevator,  elevon  and  canard,  each  commanded  through  70  rad/sec  hydraulic  actuation 

systems . 

Model  Uncertainties 


The  above  model  neglects  higher  order  structural  modes  and  resonances  and  actuator 
dynamics  beyond  70  rad/sec.  Furthermore,  because  of  time  delays  in  the  RAV  data  links, 
the  actual  transfer  function  differs  substantially  from  the  nominal  model  at  high 
frequencies.  These  uncertainties  define  the  tm(u>)  bound  described  in  Section  2. 
Conservatively,  this  bound  exceeds  unity  for  all  id  greater  than  about  10  rad/sec.  To 
minimize  command-following  errors,  disturbance  responses  and  closed  loop  sensitivity  to 
low  frequency  plant  variations  it  is,  of  course,  desirable  to  maximize  loop  gains  below 
this  frequency  subject  to  the  limitations  imposed  by  uncertainty. 

The  next  section  considers  the  design  of  three  separate  control  loops.  The  first  is  a 
minimal  pitch  CAS.  This  is  followed  by  an  angle-of-attack  limiter,  and  finally  by  an 
advanced  multivariable  design  for  decoupled  flight  path  control. 

A  Minimal  CAS 


The  function  of  the  CAS  is  to  stabilize  the  vehicle  and  allow  it  to  be  flown  by 
commands  from  the  ground-based  RAV  facility.  The  commanded  variable  is  pitch  rate  and 
the  control  input  is  a  slaved  elevator/elevon  combination.  This  minimizes  sensing  and 
actuation  requirements.  To  realize  this  design  with  the  LOG  procedure  described  in 
Section  2,  either  variant  of  the  procedure  may  be  used,  i.e.,  KBF  design  with  LQR  loop 
transfer  recovery  for  point  (i)  .n  Figure  2  or  LQR  design  with  KBF  loop  transfer 
recovery  for  point  (ii).  This  is  because  SISO  loop  transfer  functions  are  identical  at 
both  points.  We  choose  the  latter  option  on  historical  precedent. 

The  first  step  of  the  design,  then,  is  to  choose  LQR  parameters  for  the  weighting 
matrices  Q’H'Th  and  R  =  pi  (the  duals  of  equation  (10)).  A  first  intuitive  choice  for 
H  is  (0010),  which  corresponds  to  weighting  the  commanded  variable  pitch  rate.  In 
light  of  the  dual  of  equation  (11),  namely 

o(T(ju)]  t  o[H|juI-A|-U]/^  (27) 

for  all  id  such  that  o(T]  >>  1, 

the  quality  of  this  choice  can  be  examined  without  actually  computing  the  LQG 
regulator.  Rather,  we  examine  the  singular  value  plots  of  the  transfer  function 
H  ( jioI-A)  "1b.  This  function  can  be  readily  constructed  with  HONEV-X  commands.  We 
simply  type 

>addname  Al.a  x.a 

>addname  Al.b  x.b 

>addname  Al.h  x.c  (28) 

>cisab  x  -3,  2,0. 1 

>matsvd  x.g  1 

>matplt  x .g . sigma/Al :  H(sI-A)B/  lwlr 

These  commands  assume  that  we  have  previously  defined  the  files  Al.a,  Al.b  and  Al.c  for 
the  aircraft  dynamics  and  Al.h  for  our  first  choice  of  weights.  The  programs  "matini" 
and  "matchg"  are  available  for  this  purpose.  The  three  "addname"  commands  then 
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associate  a  dummy  system  name  "x"  with  a  subset  of  these  files.  That  is,  the 
{A ,  B, C)  triple  of  system  "x"  is  equivalent  to  the  <A,B,H)  triple  of  system 
"Al".  The  program  "cisab'1  which  expects  files  in  the  standard  {A,B,C)  convention 
can  now  operate  on  system  ”x"  to  produce  the  desired  transfer  function  file  x.g.  Next, 
the  program  "matsvd"  performs  a  singular  value  decomposition  of  x.g  (an  absolute  value 
operation  for  the  current  SISO  case) ,  and  "matplt"  displays  the  resulting  singular 
value  file,  x.g.sigma,  as  a  function  of  log  frequency.  The  resulting  plot  is  shown  in 
Figure  4A. 

According  to  equation  (27),  the  shape  of  Figure  4A  will  be  the  shape  of  the  LOR  loop 
whenever  its  magnitude  is  large.  It  therefore  follows  that  our  first  weighting  choice 
would  produce  insufficient  low  frequency  gain  for  a  bandwidth  constraint  of  10 
rad/sec.  An  obvious  way  to  overcome  this  limitation  is  to  weight  the  pitch  attitude 
response  (the  integral  of  pitch  rate)  instead.  This  makes  H=(0001).  We  can  examine 
this  alternative  by  modifying  the  file  Al.h  with  "matchg"  and  then  retyping  the  command 
list  (28).  On  the  other  hand,  because  (28)  will  be  used  frequently,  it  can  be  saved  as 
a  macro  command.  For  illustration,  we  will  call  this  macro  "lqg_weight" ,  and  we  will 
set  the  macro  up  to  treat  the  string  "Al”  in  (28)  as  an  argument.*  The  single  command 

>lqg_weight  Al  (29) 

will  then  generate  and  display  the  new  singular  value  plot  shown  in  Figure  4B.  While 
this  alternative  clearly  shows  improved  low  frequency  properties,  its  gain  at 
frequencies  below  0.10  rad/sec  is  still  very  small.  This  can  be  remedied  by 
introducing  still  more  amplification  into  the  weighted  response  at  low  frequencies. 

For  example,  we  can  define  a  new  system  ”A2"  by  appending  an  added  state  to  "Al"  of  the 
form 


x5(s)  =  1  8 ( s) .  (30) 

S+0.02 

The  weighting  S'  =  9+0.14  xj  then  gives 


S' (s)  =  s+0.16  0 ( s )  (31) 

s+O . 02 

with  H  =  (000  1.0  0.14).  The  command  "lqg_weight  A2"  now  generates  Figure  4C.  This 
weight  selection  exhibits  very  desirable  loop-shape  properties. 

TABLE  2.  MACRO  lqr_weight 


&command_l ine  off 
addname  &l.a  x.a 
addname  sl.b  x.b 
addname  &l.h  x.c 
ec  cisab  x  &2 
ec  matsvd  x.g  1 

ec  matplt  x .g , sigma/&l :  H(sI-A)B/  lwlr  &3 
dn  x.a  x.b  x.c 
delete  x.g  x.g.sigma 

Once  the  weight  selection  is  complete,  it  remains  to  design  the  LQG  regulator  and  to 
recover  its  full-state  loop  transfer  function  with  a  properly  designed  Kalman-Bucy 
filter.  This  can  also  be  done  very  conveniently  with  HONEY-X  commands.  The  regulator 
design  and  singular  value  plot,  for  example,  can  be  accomplished  with  a  user  macro 

>lqt  A2  9.0  (32) 

The  HONEY-X  commands  which  make  up  this  macro  are  defined  in  Table  3.  The  arguments 
specify  that  system  "A2"  be  used  for  the  design  and  display  with  a  p  value  of  9.0. 

From  equation  (27)  and  Figure  4C,  this  value  should  produce  crossover  between  5-10 
rad/sec.  Figure  5A  shows  the  resulting  full  state  loop  transfer  function.  Note  that 
the  low  frequency  gain  duplicates  the  shape  of  Figure  4C  and  that  10  rad/sec  bandwidth 
is  cohsistent  with  our  stability  robustness  requirements. 

A  sequence  of  KBF's  with  measurement  0'  is  next  used  to  recover  the  above  loop 
transfer  at  point  (ii).  The  filter  design  parameters  are  the  duals  of  equation  (8), 
namely 

M  =  qBBT  q  =r-  ” 

N  =  ul  U  >  0  fixed  (33) 


*In  Multics  Command  language,  "Al"  is  replaced  by  "&1"  and  the  entire  list  (28)  is 
saved  as  an  executive  command:  lqg_weight.ec.  In  order  to  avoid  later  name  conflicts 
from  repeated  executions,  the  exec  com  should  also  delete  names  x.a,  x.b,  x.c  and  files 
x.g,  x.g.sigma  as  part  of  each  exeuction.  The  resulting  macro  is  shown  in  Table  2.) 
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Figure  4.  Weighting  Selection  for  Pitch  CAS 


TABLE  3.  MACRO  lqr 


4command_line  off 

ec  matsid  4l.r  (cdim  4l.bl,42 

ec  dlqgain  41 

addname  4l.a  x.a 

addname  4l.b  x.b 

addname  4l.k  x.c 

ec  cisab  x  43 

ec  matsvd  x.g  1 

ec  matplt  x . g . sigma/4l :  Kc(sI-A)B  Rho  =  42/  lwlr  44 
dn  x.a  x.b  x.c 
delete  x.g  x.g. sigma 

Again,  a  single  user  macro  accomplishes  these  operations: 

>lqr_recovery  A2  1.0  1.0  (34 

The  commands  in  this  macro  are  given  in  Table  4.  The  arguments  here  specify  that 
system  "A2"  be  used  to  design  tne  KBF  with  u=1.0  and  q*1.0.  The  macro  also  displays 
the  resulting  LQG  loop  transfer  function: 

=  (Kc(sI-A+BKc+KfC) -iKf )  C(Sl-A)-lB 


KG(s) 


(35 


Macro  " lqr-recovery '  was  invoked  several  times  with  sequentially  larger  values  of  q. 
The  results  are  shown  in  Figure  5B.  Note  that  the  LQR  function,  KG(s),  does  indeed 
approach  the  full  state  design  and  that  the  value  q=10^  produces  adequate  recovery 
over  the  required  frequency  range.  Hence,  Figure  4B  represents  an  adequate  minimial 
CAS  design. 


TABLE  4.  MACRO  lqr_recovery 


4command_line  off 

ec  matsid  il.n  [rdim  4l.c|,  42 

ec  matscl  5.1. b  41. gamma  43 

ec  dkalman  41 

ec  matmpy  41. kf  4l.c  x.t 

ec  matsub  41. a  x.t  x.a 

ec  matmpy  4l.b  41. k  x.t 

ec  matsub  x.a  x.t  x.a 

addname  4l.kf  x.b 

addname  41. k  x.c 

ec  cisab  x  44 

ec  matmpy  x.g  41. g  41. kg 

ec  matsvd  41.  kg  1 

ec  matplt  4l . kg . sigma/4l :  Kls}G{s)  Mu=42  q=63**2/lwlr  45 

dn  x.b  x.c 

delete  x.g  x.a  x.t 


A.  Full  State  Loop  Transfer 


B.  Loop  Transfer  Recovery 


A 2 :  Kc<il-0>9  Pho-  9. 


<09 


*r€99«t»C9 


A2:  «<«>C(«>  Hu>t  I00«t2 


Figure  5.  CAS  Design  Sequence 


An  Anqle-of -Attack  Limiter 

For  highly  maneuverable  aircraft,  the  basic  CAS  designed  above  must  usually  be 
augmented  with  "boundary  limiting  controllers"  for  angle-of -attack  and/or  normal 
acceleration.  Angle-of-attack  limits  are  imposed  to  avoid  stall,  buffet,  extreme  drag 
increases  and  loss  of  control.  Normal  acceleration  limits  are  based  on  structural 
considerations.  Boundary  limiting  controllers  are  designed  as  regulators  to 
independently  control  either  angle-of-attack  or  normal  acceleration.  Transitions 
between  these  limiters  and  the  normal  CAS  involve  switching  logic  which  is  discussed  in 
[201  • 

The  design  of  boundary  limiters  can  be  achieved  with  the  same  LQG  used  to  design  the 
pitch  CAS  in  the  previous  section.  Weighting  the  integral  of  angle-of-attack,  for 
example,  and  adjusting  the  parameter  p  results  in  the  LQR  loop  gain  shown  in  Figure 
6A.  This  satisfies  our  performance  objectives  and  is  consistent  with  bandwidth 
limitations.  Using  integrated  angle-of-attack  as  the  sole  measurement,  however,  the 
sequence  of  KBF's  in  Figure  6  fails  to  recover  the  full  state  loop  transfer  function. 
Subsequent  evaluation  of  the  a/5  transfer  function’s  zeros  with  the  HONEY-X  command 

>zeros  A3  (36) 

revealed  the  presence  of  a  non-minimum  phase  zero  at 

s  »  +0.195  rad/sec  (37) 

This  zero  violates  the  minimum-phase  assumption  in  Section  2.  The  results  here  show 
thatthis  assumption  is  not  a  technical  one.  Rather,  non-minimum  phaseness  imposes 
fundamental  design  performance  constraints. 

The  loop  properties  can  be  recovered,  however,  by  adding  other  measurements  to  the  KBF 
to  obtain  a  minimum  phase  plant.  Adding  a  pitch  rate  measurement,  in  this  case,  is 
sufficient.  The  LQR  loop  gain,  T(s),  and  the  function  KG(s)  for  several  KBF  iteration 
usinq  the  two  measurements  is  shown  in  Figure  6C.  Note  that  full  state  transfer 
function  recovery  has  been  restored. 
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Loop  Transfer  Recovery  with 
Measurement 
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C.  Loop  Transfer  Recovery  with  q,  a  Measurement 

Figure  6.  Angle-of -Attack  Limiter  Design  Sequence 

Decoupled  Flight  Path  Control  (DFPC) 

This  section  discusses  our  final  example,  an  advanced  multi-input/multi-output  control 
law  for  DFPC.  Design  of  these  flight  modes  is  a  subject  of  continuing  research 
interest . 


DFPC  Objectives: 

with  the  advent  of  digital  flight  control  and  the  availability  of  multiple  surfaces  on 
new  fighters  there  is  interest  in  designing  control  modes  for  precision  flight  path 
control.  These  modes  have  the  objective  of  decoupling  attitude  from  flight  path 
motions.  They  offer  precise  control  for  certain  tasks  as  well  as  the  possibility  of 
new  tactics  for  the  advanced  fighters.  Three  common  decoupled  flight  path  control 
objectives  are  [211  : 

Vertical  Translation 

Vertical  velocity  control  at  constant  6  (a  varies).  Attitude  remains 
constant  as  the  velocity  vector  rotates. 

Pitch  Pointing 

Attitude  Control  at  constant  flight  path  angle  (i.e.,  0  *  a)  .  Note  that 
the  velocity  vector  does  not  rotate.  The  angle  of  attack  is  varied  at  a 
constant  normal  acceleration. 

Direct  Lift 


Flight  path  control  at  constant  angle  of  attack  (i.e.,  Y  =  0.  This  mode 
produces  a  normal  acceleration  response  without  changing  angle-of -at tack . 

All  three  of  these  modes  are  illustrated  in  Figure  7.  They  can  all  be  achieved  by  a 
single  multivariable  controller  which  provides  independent  regulation  of  pitch  attitude 
and  ang le-of -attack .  For  example,  vertical  translation  can  be  accomplished  by 
commanding  ana-profile  while  holding  R  constant.  Similarly,  pitch  pointing  can  be 
accompl  ished  by  simultaneously  commanding  a  and  O,  and  direct  lift  by  commanding 
9  with  a  constant. 
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Figure  7.  DFPC  Modes 

As  discussed  above,  the  basic  objective  of  a  decoupled  flight  path  control  loop  is  to 
provide  independent  control  of  angle-of-attack  and  pitch  attitude  with  approximatly 
equivalent  speeds  of  response.  The  controls  available  for  this  function  consist  of  the 
standard  elevator  (8e)  and  a  forward  mounted  canard  (8C) .  As  before,  uncertainties 
in  the  actuators,  structural  resonances  and  time  delays  in  the  RAV  facility  restrict 
the  loop  bandwidth  to  approximately  10  rad/sec. 

Because  loop  transfer  properties  are  desired  specifically  at  the  measured  outputs, 
(i.e.,  point  (i)  in  Figure  2)  the  LQG-based  design  procedure  begins  by  designing  a  KBF 
with  desired  loop-transfer  properties.  Letting  the  first  candidate  parameters  for  this 
design  be 

M  =  rrT  with  r=B  (38) 

N  =  ul  with  u=l 

and  appending  integrators  to  0  and  a  at  the  outset  (based  on  our  experience  above), 
the  user  macro 

>kbf_weight  A4  (39) 

gives  the  a-plots  for  C(sl^-A)_1r  shown  in  Figure  8A.  These  plots  exhibit  wide 
separation  between  £  and  a.  As  noted  in  Section  2,  the  KBF  loop  transfer  function 
will  closely  approximate  these  o-values  whenever  n>>l.  Hence,  we  must  expect 
dramatically  different  gains  and  speeds  of  response  in  two  loops  of  the  system. 

Singular  value  separations  can  be  alleviated  by  "balancing"  the  noise  input  matrix. 

For  example,  with 

r=B(Vf_1)  (40) 

where  V  is  the  right  singular  vector  matrix  of  G(j(0j)  and  Z  is  the  corresponding 
matrix  of  singular  values,  the  two  new  singular  values  will  become  balanced  (identical) 
at  hi  -  Wj.  This  follows  from  (23).  For  uij  =  10^*5,  the  following  HONEY-X 
commands  calculate  V  and  Z: 

>addname  A4.a  x.a 
>addname  A4.b  x.b 

>addname  A. 4. gamma  x.c  (41) 

>c  isab  x  0 . 5 , o . 5 , 1  . 

>matsvd  x.g  1.3 

>matprt  x.g . sigmao. 5 , 0 . 5 , l . 

>matprt  x.g.v  0.5,0. 5,1 
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The  results  are 


V 


.962 

-.274 
+j. 00046 


-.274 

-.962 
+ j . 0016 


and  £ 


diag  (1.8  .0077) 


(42) 


While  V  is  generally  complex  and  not  rational  as  a  function  of  <a,  note  that  its  value 
at  uj  »  100*  5  is  nearly  real.  Using  only  the  real  part  in  (40),  therefore,  gives  a 
good  balancing  approximation.  This  is  verified  in  Figure  8B  which  shows  the 
corresponding  new  a-  plots. 


The  full-state  KBF  loop  transfer  function,  T(s)  «  C(sI-A)_1Kf,  corresponding  to  the 
weights  in  Figure  8B  is  shown  in  Figure  9,  together  with  the  loop  transfer  GK(s)  for 
several  steps  of  the  subsequent  LQR  design  sequence.  Following  Section  2,  these  steps 
use  the  design  parameters 


Q  =  qCTC  q  =£•  ” 

R  =  p  I  p  f ixed  (43) 


The  convergence  of  GK(s)  toward  T(s)  is  clearly  evident.  It  appears  that  the  design  at 
q  =  10®  is  an  adequate  approximation  of  T.  Sigma-plots  of  its  total  return 
difference  function,  I+GK,  and  its  stability  robustness  function  GK(I+GK)~1  are  shown 
in  Figure  10.  These  verify  that  design  objectives  (1)  and  (3)  are  indeed  satisfied  for 
the  multivariable  DFPC  system. 
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Figure  8.  Weighting  Selection  for  UFPC 
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Figure  9.  DFPC  Design  Sequence 
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Figure  10.  Close  Loop  DFPC  Properties 


IV.  CONCLUSIONS 

This  paper  has  illustrated  the  use  of  the  LQG  design  methodology  for 
frequency-domain-oriented  multivariable  feedback  design.  The  basic  frequency  domain 
properties  of  LQG  were  reviewed,  key  algorithms  needed  for  its  implementation  were 
discussed,  and  an  interactive  computer-aided  design  setup  was  introduced  to  carry  out 
LQG  designs  effectively.  These  elements  were  illustrated  with  several  fight  control 
designs  for  highly  maneuverable  aircraft.  While  many  additional  design  efforts  need  to 
be  carried  out  before  final  judgements  should  be  passed,  the  LQG  procedure  and  its 
interactive  computer-aided  implementation  appears  to  offer  a  very  powerful  and 
efficient  approach  to  modern  multivariable  feedback  design. 
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SUMMARY 

This  paper  explains  the  notions  of  (almost)  controlled  and  conditionally  invariant 
subspaces  of  linear  systems  and  outlines  their  application  to  the  synthesis  of  feedback 
compensators  for  disturbance  rejection  and  to  robust  controller  design. 

INTRODUCTION 

In  this  lecture  we  will  present  a  general  introduction  to  the  disturbance  rejection 
and  the  robust  controller  design  problems  which  we  will  view  as  a  procedure  for  designing 
a  high  integrity  feedback  control  system.  The  general  problem  set-up  is  the  following.  The 
model  which  will  be  used  for  the  plant  is  shown  in  Figure  1.  The  plant  thus  accepts  two 
types  of  inputs : 'unwanted'  disturbances  d  and  controls  u  which  the  designer  can  choose, 
and  it  produces  two  types  of  outputs:  the  measured  outputs  y  and  the  to-be-controlled  out¬ 
puts  z.  This  model  suggests  that  the  disturbances  are  externally  generated  signals.  There 
are  of  course  many  situations  in  which  in  addition  to  external  inputs,  the  disturbances 
also  consist  of  parameter  variations,  uncertain  non 1 i near i t i e s ,  and  internal  noise  in  the 
system.  We  will  lump  all  these  perturbations  which  act  on  the  nominal  plant  into  one  uncer¬ 
tain  feedback  loop  which  yields  the  situation  shown  in  Figure  2. 
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FIGURE  I 


FIGURE  II 


In  control  system  synthesis  we  are  asked  to  design  a  feedback  compensator  which  uses 
the  measurement  output  in  order  to  select  the  control  input  (see  Figure  3)  according  to 
some  design  requirement  which  seeks  to  actieve  desirable  properties  for  the  closed  loop 
system.  This  closed  loop  system  is  shown  in  Figure  4(a)  for  the  case  of  external  disturb¬ 
ances  and  in  4(b)  for  the  case  that  there  is  also  an  uncertain  loop  in  the  system. 
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In  this  lecture  we  will  concentrate  on  what  is  perhaps  the  most  natural  question  to 
ask  in  this  context,  namely  the  problem  of  disturbance  rejection .  The  design  philosophy 
is  then  to  choose  the  feedback  compensator  in  such  a  way  that  in  the  closed  loop  sys ter. 
(see  Fig.  4(a))  the  to-be-controlled  output  z  is  independent  of  the  disturbance  d.  This 
design  criterion  is  called  exact  disturbance  rejection.  We  will  also  investigate  the  ques¬ 
tion  of  when  the  influence  of  d  on  z  can  be  made  arbitrarily  small  in  some  precise  sense 
( almost  disturbance  rejection ) .  We  will  return  to  the  implications  of  this  synthesis  ap¬ 
proach  to  the  design  of  robust  controllers  i.e.  feedback  compensators  which  protect  the 
closed  loop  system  behavior  against  uncertain  loops  in  t'he  system  (see  Fig.  4(b)). 

This  paper  contains  only  the  outline  of  the  theory.  For  proofs  and  more  details  we 
refer  to  the  literature.  A  few  words  about  notation;  IR  denotes  the  real  line,  t  the  com¬ 
plex  plane,  IR  +  :  *  [0,«>)  ,  IR“  :  =(-«>,  0]  ,  and  for  n  a  positive  integer  in:  »  (l,2,...,n}. 

The  spectrum  (i.e.,  the  eigenvalues  counting  multiplicities)  of  the  square  matrix  M  will 
be  denoted  by  o(M).  We  will  often  consider  families  of  subspaces  of  a  given  finite  dimen¬ 
sional  vector  space  X.  Let  I*  be  such  a  family;  we  will  say  that  Ij  is  closed  under  addition 
if  L i  ,  Lj  €  L  Lj+L2  €  L  and  closed  under  intersection  if  Lj  ,  Lj  ^  i  #  LjflLj  f  L.  The  sub- 
space  sup  L  denotes  the  smallest  subspace  of  X  which  contains  every  element  of  L  while 
inf  Ij  denotes  the  largest  subspace  of  X  contained  in  every  element  of  L.  In  general  sup  L 
and  inf  L  donot  belong  to  L.  However,  the  following  (trivial)  case  yields  a  result  which 
is  very  important  to  us: 

Lemma  0 :  If  L  is  closed  under  addition  then  sup  L€L  and  if  L  is  closed  under  inter¬ 
section  then  inf  L€  L. 


Let  A :  X~*X  be  a  linear  operator  on  the  n-dimensional  vector  space  X  and  LcX  such  that 
ALCTL,then  we  say  that  L  is  A-invariant.  We  will  define  A|L  and  A(mod  L)  by  the  commutative 
diagram : 


p  i 

X ( mod  L)  • 


l P:i 

►  X  (  mod  L) 


canoni cal 
injection 
canon i ca 1 
projection 


If  we  choose  a  basis  in  X  such  that  X-X^e  X2 
with  Xj«L  and  ^x  (mod  L)  then  in  this  partition 

A  looks  like  A  ^2]  *nd  A11  “  A|L' 

A22  =  A  *mod  L*  * 


Let  L 1 , L2  €  X .  Then  we  will  say  that  Lj  is  A (mod  L2 ) -invar iant  if  AL[ c  Lj  +  L2  and  that 
Lj  is  Al  L2  invariant  if  A  (  L 1  H  L2  )  cr  L  j  .  Furthermore  <A|Li>:  =  Li +ALj  +  .  . . +An“ 1  Li  and 
<Lj|A>  :  =  Lj  fl  A“lLj  fl  ...nA‘“n+1L1  denote  respectively  the  smallest  A-invariant  subspace 
containing  L j  and  the  largest  A-invariant  subspace  contained  in  Lj .  In  terms  of  the  sys¬ 
tem  x  =  Ax  +  Bu;  y  =  Cx,  R :  =  <Al LmB>  is  the  reachable  subspace,  while  N :  <kerC)A>  is 
the  non-observable  subspace  (im  and  ker  denote  image  (range)  and  kernel  (nullspace) 
respectively) . 


Let  £:  T  -*  X  with  T  =  3R  ,3R  or  more  generally  an  arbitrary  interval  in  IR,  and  X  a  norm- 
ed  finite  dimensional  vector  spage.  Then  we  will  say  that  f €  L  with  1  <  p  <®  i f 

-  P 

Hfj|  =  1  (  Jpllfj  t  )ll  Pdt)  1//p<oo  for  l<p  <«> 

p  I  ess  supllf(t)ll  <  00  for  p  =  °° 

'  t  ” 


The  notation  a.c.  stands  for  ’absolutely  continuous’  and  a.e.  stands  for  ’almost  every¬ 
where’  and  is  used  here  exclusively  in  connection  with  Lebesgue  measure. 


MATHEMATICAL  MODEL 

We  will  consider  the  linear  time -i nvar iant  finite  dimensional  plant: 

:  x  =  Ax  +  Bu  +  Gd  ;  y  **  Cx,  z  =  Hx 

with  x€X  =  IRn,  u  €  U  =  IR™ ,  d€D  =  IR*^ ,  y  €  Y  =  IRp,  and  z€Z  =  IR^  .  The  feedback  compensator 
If  will  be  taken  to  belong  to  the  same  category,  i.e.,  be  linear  t i me - i nva r i a n t  and  finite 
dimensional : 

I  :  w  =  KW  +  Ly  ;  u  =  Mw  +  Fy 


The  closed  loop  system  has  d  as  exogenous  input  and  z  as  output  and  is  described  by: 

I  :  X®  =  Aexe  +  Ged  ;  z  *  H^6 
cl 


with  x  :  *  (x,w) ,  and  (A  ,G  ,H  )  matrices  obtainable  from  (A,B,G,C,H)  and  ( K , L , M , F )  in  a 
straightforward  manner: 


A  +  BFC 

BM 

e 

G 

e 

LC 

K 

( ;  = 

0 

H 

lH 

The  solution  z  to  is  given  by  the  variation  of  constants  formulc 


,  ,  e  A  t  p 

7.  (  t )  -  H  e  X  ( 0  ) 


r.,e  A  (t-T)  e 
J  h  e  C.  dll)  c|  t 
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and  shows  the  additive  influence  on  z  of  the 
d(«).  We  will  denote  the  closed  loop  impulse 

function  by  G  , .  Hence 
—cl 


W  :  IR  -*m 


■e«q 


w  (  t) 


initial  condition  >ce(0)  and  the  disturbance 
response  by  W  and  the  closed  loop  transfer 


t-<r 


G  .  (s  )  = 

—cl 


0  —  l  e 

(Is-A  )  G 


EXACT  DISTURBANCE  REJECTION 

In  this  context  the  exact  disturbance  rejection  problem  may  be  formulated  as  follows: 

( DDPM )  :  The  disturbance  decoupling  problem  by  measurement  feedback:  Given  lp,does  there 
exists  If  such  that  W  =  o?  If  so, f in d  algori th ms  for  computing  such  a  I  «  (k,L,m,F ) f rom 
I  «  (A, B ,G,C, H)  . 

P 

Motivated  in  part  by  the  above  problem,  there  have  been  some  important  new  develop¬ 
ments  in  the  area  of  linear  systems  since  about  1970.  These  center  around  a  number  of  new 
concepts.  First  controlled  invariant  and  controllability  subspaces  (which  are  the  key  con¬ 
cepts  in  the  book  of  Wonham  [l]  who  did  much  of  the  original  work  ir»  this  area)  ,  further 
the  conditionally  invariant  and  complementary  observability  subspaces  (see  [2]  and  for  a 
recent  fully  developed  treatment  [ 3 } )  and  finally,  the  'almost'  versions  of  these  concepts 
[4,5].  These  ideas  still  play  a  very  important  role  in  the  research  in  system  theory  and 
numerous  relevant  papers  keep  appearing. 

Consider  the  linear  system 


I  :  x  *  Ax  +  Bu 


We  will  denote  by  IX(A.B)  all  possible  state  trajectories  generated  by  I.  Formally, 
IX(A,B):  =  {x:  IR  -*  X 1  £  is  a.c.  and  3u:  IR -»  U  such  that  x(t)  =  Ax(t)  +  Bu(t)  a.e.}.  Equiva¬ 
lently,  £  x  ( A  ,  B )  =  { x  :  3R  -*  X  I  x  is  a.c.  and  x(t)  -  Ax  ( t)  €  ira  B  a.e.}. If  there  is  no  chance 
for  confusion  we  will  denote  I  (A,B)  by  Ix. 

Definition  1:  A  subspace  V c  x  is  said  to  be  a  controlled  invariant  subspace  if 
VxQ  €  V ,  3x€Ix  such  that  *  (0 )  =  xo  and  x(t)  €  V,Vt.  (see  Fig.  5) 


FIGURE  V: CONTROLLED 
INVARIANCE 


FIGURE  VI 


Let  V  denote  the  set  of  all  invariant  subspaces,  and  V(K)  those  contained  in  a  given 
subspace  K  of  X.  The  following  property,  essentially  trivial,  is  crucial  in  applications. 

p r opos i t ion  1 :  v  is  closed  under  subspace  addition  (i.e. ,Vi,V2  6  Vj  +  V2 €  V)  . 
Consequently ,  supV(K)  =  :VJ6V(K). 

Equivalent  properties  for  controlled  invariance  are: 

(i)  V€  V  (holdability  by  open  loop  control  laws) 

(ii)  3F  such  that  (A+BF)Vc  V  (holdability  by  feedback  controls:  if  we  use  the 
control  law  u=Fx  on  I  then  we  obtain  the  closed  loop  'flow'  x  =(A+BF)x  and 
jc  ( 0 )  €  V  then  yields  x(t)  =  e(A  +  BF)tx(0)  C  V) 

(iii)  AV c  V  +  imB  (A(mod  imB) 'invariance  (see  Fig.  6)) 


(DDP) 

Let  us  now  consider  our  disturbance  rejection  problem  with  full  state  feedback,  i.e. 
the  problem  whether  there  exists  a  feedback  matrix  F  such  that  u  =  Fx  will  decouple  the 
disturbance  d  from  z  in  x  -  Ax  +  Bu  ♦  Gd ;  z  =  Hx.  This  problem  is  denoted  by  (DDP) .  Its 
solution  is: 


Theorem  1  : 


f (DDP) 


is  solvable)  *•  (imGcV*  „ } 
ker  H 


7-4 


The  synthesis  for  (DDP)  goes  as  follows: 

Data :  A  ,  B  ,  G  ,  H 

Feasibility  Computation:  Compute  and  verify  whether  imG  cz 

Feedback  Gain  Computation:  Compute  F  such  that  (A  +  BF)Vi{erH  c  vkerH 

Then  u  =  Fx  will  solve  (DDP) . 


(DDEP) 

In  order  to  get  observations  into  the  picture,  we  will  introduce  the  so-called  con¬ 
ditionally  invariant  subspaces.  Consider  therefore  the  linear  system 


Z*  :  x  =  Ax  ;  y  =  Cx 


Now  let  S  be  a  subspace  of  X  and  consider  the  trajectories  modulo  s.  In  other  words  if 
P  is  a  (surjective)  map  with  S  *  kerP  we  consider  the  output  w  =  x(modS)  =  Px. 

Definition  2:  s  is  called  a  conditionally  invariant  subspace  if  there  exist  K,L  such 
that  w  =  Kw  +  Ly . 

The  conditionally  invariant  subspaces  behave  completely  dually  to  controlled  in¬ 
variant  subspaces.  Let  £  denote  the  set  of  all  conditionally  invariant  subspaces,  and  S_  ( K ) 
those  containing  a  given  subspace  K  of  X.  The  following  property,  essentially  trivial,  is 
crucial  in  applications: 

Propos i tion  2 :  S  is  closed  under  subspace  intersection  (i.e.  Si,S2€S^SiH  S2€  S) . 
Consequently  inf  S(K)  =:  S*£S(K). 

This  S{£  plays  a  very  important  role  in  applications  and  starting  from  A,C  and  G,  there  ex¬ 
ist  effective  algorithms  for  computing  S*  _  (see  Appendix). 

lDu 

Equivalent  statements  for  conditional  invariance  are: 

(i)  S€S  (x  (modS)  is  tractable  from  y) 

(ii)  3L '  such  that  (A  +  L'CJScS  (S  can  be  made  invariant  by  output  injection) 

(iii)  A ( S  0  kerC)  c  S  (S  is  AlkerC  -  invariant  (see  Fig.  6)) 


Actually  from  (ii)  it  is  possible  to  compute  the  matrices  K  and  L  which  allow  us  to  track 
w  =  x ( mod S )  «  Px  from  y  via  w  *  Kw  +  Ly .  This  may  be  seen  from  the  commutative  diagram 


A  +  L’C 


L  X (modS ) . 


|p :  canonical 
X  (modS) 


projection 


In  terms  of  matrices  K  is  defined  by  KP  =  P(A  +  L'C)  and  L  =  PC. 

An  immediate  application  of  the  notion  of  conditionally  invariant  subspaces  is  to 
the  disturbance  decoupled  estimation  problem  (DDEP).  The  problem  (see  Fig.  7)  is  as  fol¬ 
lows:  given  the  plant  :  x  =  Ax  +  Gd  ;  y  *  Cx,  z  *  Hx  with  d  the  disturbance,  y  the  meas¬ 
ured  output,  and  z  the  to-be-estimated  output.  We  would  like  to  construct  an  observer 

:  w  =  Kw  +  Ly  ;  z  =  Mw  +  Fy ,  which  processes  the  measurements  in  order  to  produce  an  es¬ 
timate  &  of  z/such  that,  with  the  initial  conditions  x(0)  *  0,  and  w(0)  »  0,  the  estimation 
error  e  =  z  -  2  is  independent  of  d,  i.e.  such  that  the  transfer  function  d  e  is  zero. 


DISTURBANCE 

d 


TO-BE-ESTIMATED  OUTPUT 


FIGURE  VII 


Theorem  2  :  {(DDEP)  i 8  3 O  Ivab  le  }  —  { ft  kerC  c  kerH} 

The  synthesis  of  the  disturbance  decoupled  observer  goes  as  follows: 

Data :  A , G  , C , H 

Feasibility  computation:  Compute  S*mG  and  verify  whether  sfmGH  kerCc  kerH 

Observer  Gain  Computation:  Compute  L'  such  that  (A  +  L ‘ c ) s i mG  c s i mG ■  Let  K  and  L  bG 

as  defined  in  the  above  commutative  diagram:  KP  =  P(A  +  L’ 
and  L  =  PC  with  P  the  canonical  projection  X  "*  X  (mods  JmG)  . 
Let  M  and  F  be  such  that  H  *  MP  +  FC  (such  M,F  exist  be¬ 
cause  of  the  condition  of  Theorem  2)  .  The  system  w  *  Kw  + 
Ly  ;  i  =  Mw  +  Fy  provides  the  desired  observer. 


C) 
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(DDPM) 


We  now  turn  our  attention  to  the  disturbance  decoupling  problem  with  measurement 
feedback  as  introduced  in  the  beqinning  of  this  section.  The  main  ideas  behind  the  solu¬ 
tion  of  this  problem  are  as  follows.  Consider  the  following  generalization  of  both  A(mod  imB/ 
and  AlkerC  -  invariant  subspaces  for  the  system 


Defi n i t i on  3  : 


I :  x  *  A  x  +  B  u  ;  y  *  C  x 


A  subspace  L  of  X  is  called  (A,B,C)  -  invariant 


,y  L* 


Let  L  denote  the  family  of  all  such  subspaces.  It  is  easy  to  show  that 

1l€l}**{3F  such  that  (A  +  BFC)Ul}.  Unfortunately  L  is  neither  closed  under  subspace 
addition  nor  intersection  and  L  lacks  hence  many  of  the  nice  properties  of  V  or  S. 

Now,  if  we  had  an  L£  L  wedged  in  between  imG  and  kerH:  imGC  LC  ketH  then  (DDPM) 
would  be  soved.  Indeed,  construct  F  such  that  (A  +  BFCJLc  L  (linear  algebra)  and  use  the 
memory  less  feedback  compensator  u  =  Fy  on  51  p  -  Then  it  is  easy  to  see  that  the  closed  loop 
system  Z c 1 :  x  »  (A  +  BFC) x  +  Gd  ;  z  *  Hx  will  be  disturbance  decoupled,  since  <A  +  BFCfimG> 
c  kerH  obviously  implies  that  the  transfer  function  d  *-►  z  is  zero.  However,  at  the  present 
time  no  constructive  conditions  for  finding  suchan  l  are  known.  In  order  to  get  around 
this  problem,  we  introduce  the  notion  of  (A,B,C)  -  pairs  (first  introduced  by  Schumacher 

-  see  13]  for  many  other  applications  of  this  idea)  : 

Definition  4:  A  pair  (V,S)  with  V  €  V ,  S  £  S  and  VcS  will  be  called  an  ( A , B , C ) -pax r . 

( A , B , C )  -  pairs  come  in  very  handy  when  instead  of  considering  Z :  x  *  Ax  +  Bu  j  y  *  Cx, 
we  consider  an  extension  of  it:  Ie :  x  =  Ax  +  Bu,  w  =  ua  ;  y  =  Cx,  ya  =  w  and  consider  I 
with  state  space  xe  *  X©W,  (u,ua)  as  control  input  and  (y,ya)  as  observed  output.  Let 
P  be  the  projection  P  :  Xe -♦  X.  Then  it  is  easy  to  show  the  following  simple 

Proposition  3:  { Ve  €  Ve  }  «*  { P  ve  £  V  )  and  (se£  Se)»  iseH  *£  S) 

From  this  proposition  it  follows  imediately  that  every  Le  6  Le  generates  with  (PLe,  Le  (1  X) 
and  ( A , B , C ) -  pair.  The  nice  thing  however  is  that  every  (A,B,C)  -  pair  is  obtained  this 

way.  in  other  words  .given  an  (A,B,C)  -  pair  (V,S),  there  exists  an  extension  Ie  of  I  (the 

dimension  of  W  in  the  extension  need  never  be  larger  that  dimv  -  diraS)  and  an  Le  €  Le  such 
that  PLe  =  V3S  *  Le  H  X  .  In  order  to  see  this,  take  W  =  V(modS)  and  let  Z  c  V  ©  W  be  such 
that  dimZ  =  dimW  ,  Zfl  V  =  ZflW  =  {o},  and  PZ©  S  =  V.  Then  Lfi  =  Z  ©  S  satisfies  all  the  requir¬ 
ed  properties.  From  this  it  mav  be  seen  that  any  (A,B,C)  -  pair  (V,S)  wedged  in  between 

imG  and  kerH:  imG  c  sc  V  c:  kerH,  yields  a  solution  to  (DDPM)  as  follows:  let  Le  be  such  that 
( V , S )  =  (PLe,  Le 0  X) .  Consider  the  extended  plant  I®:  x  =  Ax  +  Bu  +  Gd ,  w  =  ua ;  y  =  Cx, 

ya  =  w,  z  =  Hx,  written  compactly  as  xe  =  Aexe  ♦  Beue  +  Ged;  ye  =  Cexe,  z  =  Hexe .  Obvious¬ 
ly  Le  is(Ae,Be,Ce)  -  invariant  and,  since  imG  =  imGecs  =  Le  0  X  c  Le  <=  PLe  ©  W  =  V®W<=kerH®W. 
i'-  is  wedged  in  between  imGeand  kerHe  .  Hence  there  exists  Fe  :  Ye  -*  Ue  such  that  ( Ae  +BeFeCe )  Le 
c  Le  .  This  feedback  control  will  disturbance  decouple  in  the  extended  system.  The  beau- 
ful  thing  however,  is  that  this  static  feedback  control  law  in  the  extended  plant,  is 
dynaiaic  feedback  in  the  non  -  extended  plant  I  .  Indeed,  let  Fe  :  Ya  ©  Y  -*  Ua  ©  U  be  partition¬ 
ed  as  Fe  =  |*  .  Then  writinq  out  the  feedback  law  ue  *  Feye  gives  us  the  dynamic  compen¬ 

sator  Zc  :  w  =  Kw  ♦  Ly ,  u  =  Mw  +  Fy  which  will  decouple  the  disturbance  d  from  the  controll¬ 
ed  output  z.  Formally  stated: 

Theorem  3:  {(DDPM)  is  solvable)  —  { S*  _  c:  V*  } 

-  imG  kerH 

Conceptualizing  this  into  an  algorithm  gives: 

Data :  A , B , G , C , H 

Feasibility  computation:  Compute  S*raG  and  vferiJ*  Verify  if  sImG c  v?erH * 

Compensator  Gain  Computation:  Take  dimW  *  diravf firH-dimS J mG •  Pick  l®  c  X  ®  W  such  that 

PLe  =  V{erH  and  L  (1  X  =  stmG*  Compute  Fe  such  that 

(A6  +  BeFeC6)  L6  c  L6 .  Then  Fe  =  yields  the  desired 

disturbance  rejecting  compensator. 


(DDPMS)  AND  (DDPMPP) 


In  the  disturbance 
the  problem  of  making  the 
Thus  we  know  when  and  how 

z  ( t)  = 


decoupling  problem  considered  up  to  now,  we  have  concentrated  on 
influence  of  the  disturbance  on  the  controlled  output  equal  to  zero, 
we  can  make  the  second  term  in  the  response 

HeeAGtxe(°)  +  /  HeeMt“T)Ge  d(i)  dx 
0 


eaual  to  zero.  Of  course,  in  applications  we  will  also  have  to  worry  about  the  transient  res- 
e  Ae  t  e 

oonse  He  x  (0).  It  is  natural  to  do  this  by  Imposing  some  internal  stability  requirements 
on  the  closed  loop  system.  A  qeneral  formulation  for  this  is  to  introduce  t  c  <t  (the  'good' 
part  of  the  complex  plane:  C  is  any  non-empty  subset  of  t  with  at  least  onl  point  on  the 
real  axis.  If  we  require  simple  asymptotic  stability,  we  would  have  C  .  (s  £  t  I  Rets)  <  0  }  )  , 
and  require  that  the  spectrum  of  Ae,  o(Ae),  be  in  t  .  This  yields:  9 


(DDPMS) :  The  disturbance  decoupling  problem  with  measurement  feedback  and  stability 


Given  Ip  and  tg ,does  there  exist  Xc  suck  that 
rithms  for  computing  Xc  from  Xp  and  ty. 


0  and  o(Ae)  c  L?  If  so,  find  algo- 


If  we  require  (DDPMS)  to  be  solvable  for  all  C  , 
turbance  decoupling  problem  with  measurement  feedback 


then  we  speak  of  ( DDPMPP ) :  The  dis- 
and  pole  placement. 


Unfortunately,  we  cannot  enter  into  details  about  these  problems.  They  are  treated  for 
instance  in  [6],  where  other  relevant  references  may  be  found.  We  simply  state  the  result: 


Theorem  4:  {(DDPMS)  is  solvable)  "  {S*  .  _  t=  V*  ,  „}  and  {(DDPMPP)  is  solvable) 

-  f  ,  g,imG  q , kerH 

imG  kerH 


here,  V*  (resp.  RxerH)  denotes  the  supremal  stabilizable  (relative  C  )  controlled  in- 

variant^ t reso.  cont r ol 1 ab i 1 ty )  subsoace  contained  in  kerH,  while  (?esp.  N*mG^  denotes 

the  infimal  complementary  detectability  (relative  tg)  (resp.  observability)  subspace  con¬ 
taining  imG.  These  subspaces  are  computable  by  means  of  finite  algorithms  (see  Appendix) 
and  once  the  conditions  are  verified,  the  computation  of  the  compensator  gains  may  be  carri  - 
ed  out  with  the  aid  of  a  pole  placement  routine . 


ALMOST  CONTROLLED  INVARIANT  SUBSPACES 

We  consider  again  the  system  x  *  Ax  +  Bu.  As  we  have  seen,  a  subspace  is  said  to  be 
a  controlled  invariant  subspace  if  starting  in  it,  we  can  stay  in  it  by  choosinq  the  con* 
properly.  If  we  only  require  to  remain  arbitrarily  close  to  the  subspace,  then  we  arrive 
at  an  almost  controlled  invariant  subspace  (see  Fig.  8) .  Its  formal  definition  is: 

Definition  5:  A  subspace  va c x  is  said  to  be  an  almost  ( controlled )  invariant  sub- 
space  if  Vxq  €  Va  and  £  >  0,  3x€  £x  such  that  x(0)  -  xq  and  d(x(t)  ,Va) 

<e,  Vt,  where  d  (x  ( t)  ,  V  ):  =  inf  veVall  v-x  ( t )  II.  A  subspace  Ra  c  X  is  said 
to  be  an  almost  controllability  subspace  if  VxQ,XjERa,  3  T>  0  such 
that  V  £  >0  ,  3x€  Ix  with  the  properties  that  x(0)  =  Xq,  >c(T)  *  xj,  and 

d (x(t) ,Ra)  <  £7 V  t . 


Let  Va • Ea  denote  the  set  of  ail  almost  controlled  invariant  and  almost  controllabil¬ 
ity  subspaces  and  Va  ( K )  .  Ra  ( *' )  those  contained  in  a  given  subspace  K  of  X.  It  is  easy  to 
see  that  Va(K)  and  Ra (K)  are  closed  under  subspace  addition  and  hence  sup  Ra(K)  =  :  Rj  K  ^  — a 
and  sup  ya(K)  =:  V*#K€Va. 

In  [4]  we  have  given  a  large  number  of  equivalent  statements  for  almost  invariance. 
The  main  conclusions  are  the  following: 

(i)  €  V&  (almost  controlled  invariance) 

(ii)  V  €  V  +  R^  (every  Va  €  va  is  decomposable  into  a  controlled  invariant  and  an  al¬ 
most  controllability  subspace) 

(iii)  3F  and  subspaces  imB  ^  Bj  ^  B2  ^  * ‘  ’  3  Bn  such  that  Va  =  V  +  Ra  where  V  ^  V  with 
(A+BF)V<=V  and  Ra  f  Rj  with  Ra  -  B  ]  +  ( A  +  BF  )  B2+ •  ■  ■  +  (  A  +  BF)  n  1  Bn  (this  oroDerty  of  Ra 
may  be  interpreted  as  follows:  if  we  use  first  feedback  F  and  then  look  whici. 
states  are  reachable  using  delta  functions  6  in  the  direction  Bj,6  in  the  direc¬ 
tion  B2»  etc.  »then  the  reachable  set  is  B  j  +  ( A  +  BF )  B  2  +  *  *  *  +  ( A  +  BF)  n“  .  The  repre¬ 
sentation  of  Ra  shows  that  with  smooth  approximations  of  6 , h ,  •  •  -  > 6 ^n“ ^  we  can 
achieve  such  trajectories  approximately) 

(iv) (iv)  va €  VD  :=  the  1 distributionallv  controlled  invariant  subspaces*  (see  [4]:  Va  is 
holdable  using  generalized  functions  as  inputs) 

(v)  vfl  (  vc^osure  (how  this  closure  is  to  be  understood  is  explained  in  ^^Intuiti¬ 
vely  this  means  that  3V£  C  V  such  that  limv  =  V  .  Now  3F£  such  that  (A  +  BF  ) V  c 
V  and,  as  £*+(),  Fe“*°°  Va  £  Va  but  EVa  €aV:  high  gain  feedback) 


I 
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The  theory  of  almost  controlled  invariant  subspaces  is  very  apt  for  studying  the  sol¬ 
vability  of  convolution  equations.  Indeed,  let  K:=  kerH  and  consider  the  system  x  =  Ax  +  Bu; 
z  =  Hx.  If  we  ask  when  for  a  given  x^(O)  there  exists  u  :  IR+  “*  U  such  that  z  =  Hjc  =  0^,  we  ob¬ 
tain  the  equation 

z(t)  =  HeAtx{0)  +  /  HeA(t’T,B  u ( T )  dT  ( l  ) 

0  1 

or,  after  taking  Laplace  transforms 


Z(s)  -  H(IS-A)  'x(O)  +  H(Is-A)"'b  U ( 3 ) 


(/'i» 


Let  us  say  that  L  or  equivalently  L j  is  approximately  solvable  (in  the  L j  -  sense}  if  Ve  >  0 
3  u  such  that  tlzll  ,  ^  <  ^  .  The  following  holds: 

Proposition  4  :  { L  j  is  solvable)  *♦  ix.(0)€VjcerH)  **  {/,  j  is  solvable  with  U(s)  a  strictly 

proper  rational  function)  ^ 

and  {.L  j  is  approximately  solvable)  **  (jt(0)  €  kerH:*  Rb  kerH  +  v£erH 

with  Rb.kerH  :*  fcorH  +  *♦  { £  l  is  solvable  with  u  a  generalized 

function)  (L  j  is  Solvable  with  U(s)  a  rational  function}  **  { £  j  is 
approximately  solvable  with  U(s)  a  strictly  proper  rational  function). 


(ADDPM) 


In  this  section  we  take  a  look  at  the  problem  of  disturbance  rejection  to  any  arbitra¬ 
ry  degree  of  accuracy.  This  may  be  formalized  by  introducing  norms  on  the  spaces  of  to-be- 
control  led  outputs  and  disturbance  inputs  and  requiring  the  influence  of  d  on  z  to  be  arbitrari¬ 
ly  small  in  this  norm.  The  influence  of  d  on  z  is  given  by: 


z( t)  *  /  W(t-T)  d  ( T )  dT 
0 

with  W  the  closed  loop  impulse  response.  Taking  Lp-  norms  we  are  led  to  consider 
«>)  /  IldN^o  qo)  .  It  is  well-known  that  for  all  1  i  p<< 

by  the  norm  of  W,  i.e.  by  Jllw(t)lldt.  Actually  for  p  =  1  ,«>  this  is  the  exact  value 


suPd*0 

this  induced  norm  is  CoTInded 

while 


for  1  <  p < «  it  is  in  general  only  an  upper  bound.  It  may  be  shown,  however,  that  this  fact 
is  of  no  consequence  on  the  results  which  follow  and  that  for  all  p  approximate  disturbance 
rejection  requires  that  we  make  the  L j-  norm  of  the  closed  loop  impulse  response  small.  We 
will  hence  take  the  following  problem  formulation: 


(ADDPM) :  The  almost  disturbance  decoupling  problem  by  measurement  feedback:  Given 
aruf  any  t  >  0,  does  there  exists  Ic  such  that 

°r  e  A6 1 

Jll  w  <  t )  II  dt  <  e  ,  where  w  :  t -♦  H  e  g  ? 

0  e 


If  so. 


give  algorithms  for  computing 


from  I 

P 


and 


£  . 


I 


P 


If  goes  without  saying  that  one  can  also  here  add  internal  stability  requirements.  However, 
a  complete  survey  of  what  is  available  in  this  area  is  far  beyond  the  scope  of  this  lecture. 
We  will  therefore  not  consider  further  variations  here. 


The  above  problem  with  state  measurements  y  =  x  is  denoted  by  (ADDP)  and  its  solvabil¬ 
ity  is  given  in 


Theorem  5:{(ADDP)  is  solvable)  *•  { imGcV*  .  „} 

-  b,kerH 

The  proof  of  this  theorem  also  yields  a  conceptual  algorithm  for  computing  the  required 
feedback  gain  and  may  be  found  in  [4]. 


It  is  of  interest  to  compare  the  striking  parallel  between  Theorems  1  and  5.  As  such 
one  expects  a  similar  parallel  to  exist  in  the  context  of  disturbance  decoupled  estimation 
problems.  Indeed,  this  is  the  case.  The  duals  of  almost  controlled  invariant  subspaces  are 
the  almost  condi  tional  ly  invariant  subspaces.  There  are  the  subspaces  Sa  cz  X  which  are  such 
that  x ( mods a )  may  be  constructed  arbitrarily  closely  (in  a  sense  which  is  made  precise  in 
[S]) .  All  what  has  been  said  so  far  about  almost  controlled  invariance  may  be  dualized:  this 
yields  dis t r ibut iona 1 ly  conditionally  invariant  subspaces  (in  which  x(modSa)  may  be  tracked 
using  differentiators)  and  there  exists  an  infiraal  ;.j -almost  conditionally  invariant  subspace 
containing  a  given  subspace  K  of  X.  This  subspace  is  denoted  by  Sb  K.  This  notion  is  imme¬ 
diately  applicable  to  the  almost  disturbance  decoupled  estimation  problem  ( ADDEP ) .  This 
problem  is  the  same  as  the  one  in  the  section  headed  by  (DDEP)  but  now  we  ask  for  conditions 
such  that  there  exists,  V  t>0,  an  observer  £0  such  that  the  J  j-norm  of  the  impulse  response 
d  •-*  e  =  z-i  is  less  than  £.  The  main  result  is  [  S  ]  a 

Theorem  6:  {(ADDEP)  is  solvable }  f  {S*  c  kerH) 

-  b , i mG 

We  now  arrive  at  (ADDPM).  In  this  case  some  sort  of  separation  theorem  holds  in  the 
sense  that  (ADDPM)  is  solvable  if  and  only  if  (ADDP)  and  (ADDEP)  are  both  solvable.  Hence 
we  should  be  able  to  achieve  almost  disturbance  decoupling  by  state  feedback  and  we  should 
be  able  to  reconstruct  the  to-be -con tro 1  led  output  arbitrarily  closely  from  the  measured 
output  (this  is  the  main  result  of  [ 5 1  )  : 

Theorem  7:  {(ADDPM)  is  solvable)  **[i  mG  c:  V*  ,  and  S*  c  kerH) 

-  b  ,  ke  r  H  b,imG 


SOME  SPECIAL  CASES 


There  are  a  number  of  situations  in  which  it  is  possible  to  conclude  the  solvabil¬ 
ity  or  the  generic  solvability  or  (DDP) ,  (DDPM) ,  etc.  without  having  to  go  through  the  ex 
plicit  calculations  of  vJ[erH»  v£  kerH'  e^c*  With  ’generic'  solvability  we  mean  the  follow 
ing:  consider  the  dimensions  of  X,U,D,y  and  Z  as  fixed  but  the  entries  of  the  matrices 
(A,B,C,G,H)  as  generated  'at  random’.  The  intuitive  content  of  the  notion  of  generic  solv 
ability  is  the  following;  if  for  'almost  all'  (A,B,G,C,H)  the  solvability  (e.g.  of  (DDP)) 
holds,  thin  we  say  that  (DDP)  is  gene; i cally  solvable.  For  the  precise  mathematical  defi¬ 
nition  of  this  concept  in  the  present  framework,  v,e  refer  to  [4]  . 

We  have  the  following: 


Assume : 

Then  generic  solvability  of: 

holds  iff: 

A  ,  B , C  arbitrary 

(DDP) 

ra  >  £ 

H,  G  subject  to  HG  =*■  0 

(DDPPP) 

m>£ 

(DDEP) 

p  >  q 

(DDEPPP) 

p  >  q 

(DDPM) 

m  >  £  and  p  >  q 

(DDPMPP) 

m  >  £  and  p  >  q 

A,B,G,C,H  arbitrary 

( A  DDP ) 

m  >  £ 

(ADDEP) 

p  >  q 

(ADDPM) 

ro  >  £  and  p  >  q 

Assume  : 

Then  solvability  of; 

holds  if: 

(ADDP) 

imG  c imB 

(ADDEP) 

kerC  c kerH 

(ADDPM) 

imG  c imB  and 

kerC  c  kerH 

RkerH  *  <°>  , 

(i.e.  H(Is-A) "  B 
is  left  invertible) 

(ADDP) 

dim  imB >  codim  kerH 

N?mG  =  x 

(ADDEP) 

dim  imG  £  codim  kerC 

(i.e.  C(ls-A)*  G 
is  right  invertible) 

RkerH  “ 

(ADDPM) 

dim  imB  >  codim  kerH 

"ImG  =  X 

dim  imG  < codim  kerC 

From  the  above  statements  the  importance  of  the  conditions 


#  of  controls  >1  of  to-be-controlled  outputs 

#  of  measured  outputs  >_#  of  disturbances 


may  be  seen.  We  call  this  the  law  of  requisite  variety  in  control  action  and  measurement 
capability.  Thus  good  control  is  only  possible  if  this  variety  is  sufficiently  large. 

In  4,5  a  complete  analysis  is  made  for  the  scalar  case  (m  =  q  =  p  * £  =  1)  where 
the  solvability  of  the  various  disturbance  decoupling  problems  may  be  decided  from  very 
simple  conditions  on  the  degrees  of  the  polynomials  involved  in  the  transfer  functions 
(u,d)  (y,z)  . 


APPLICATIONS  TO  ROBUST  CONTROL  SYNTHESIS 


In  this  section  we  return  to  the  problem  of  robust  controller  synthesis  introduced 
in  the  beginning  of  this  lecture  (see  Figures  2  and  4(b)).  We  assume  that  the  plant  is 
given  by 


£  ;  x=Ax+Bu+  Gd  +  G'd' 

P  y=Cx,  z=Hx,  z'  =H'x 


with  d  Mi*'  external  disturbance,  z  the  to-be -con  t  ro  L  l  ed  output,  d'  the  internal  disturbance 
uni  the  infernal  >J  i  s  t  u  rba  ncc  loop  input;  z‘  generates  d'  by  means  of  a  iy  nautical  sys~ 
f*-™  t ^ t  wh  i <■ -h  may  be  nonlinear,  uncertain,  time-varying  and  which  may  incorporate  un¬ 
certain  h o n 1 inearities ,  parameter  variations,  and  other  such  unpleasant  things. 


L  o  o  p 


From  the  (DDF’ M)  rosul 
if.  will  be  possible  f  o 

features  if  S  * 

imG  +  KD(i 


t  s  it 
muk  e 
c  V  * 


f  o 1 1 ow  s  that 
z  i  n  a* ’ii  s  i  f  »vc 

r  H  * 


,  i  f  we 
l  o  bo  t  h 


disregard  stability  considerations, 
the  external  and  internal  uncet  ;.<iin 
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If  we  require  however  that  at  the  same  time  internal  stability  should  hold  for  all  finite 
gain  /,  -  input/output  stable  systems  then  a  sufficient  condition  is  S  %  f  i  mG+  i  mG  *  c 

vg,kerH  kerH**These  conditions  are  verifiable  by  finite  algorithms  and  yield  a  feedback 
compensator  which  may  be  used  in  order  to  achieve  these  robust  control  features.  Note 
that  the  special  cases  (suitably  adapted)  given  in  the  previous  section  are  of  much  inte¬ 
rest  in  this  context  as  well. 


The  above  conditions  yield  designs  in  which  the  same  feedback  compensator  may  be 
used  for  any  F^ist*  If  we  however  rephrase  the  problem  by  asking  when  for  all  f,  which 

are  finite  gain  -  input/output  stable  with  gain<K<°°  there  should  exist  a  feeclEack 
compensator  (  whiBh  may  hence  depend  on  K)  such  that  the  closed  loop  systems  remains  sta¬ 
ble,  then  we  may  use  the  theory  of  almost  invariant  subspaces  and  we  arrive  at  the  follo¬ 
wing  conditions: 

(i)  if  the  initial  conditions  are  zero  (hence  to  some  extent  disregarding  internal 
stability  considerations)  then  a  sufficient  condition  for  the  existence  of  such 
a  compensator  are 


, ,  c  kerH  fi  kerH ' 


imG  +  imG '  c:  V* 


(ii)  if  we  also  require  internal  stability  and  take  y  =  x  the  it  suffices  to  have 


imG  +  i mG ’ c  V* 

y » 


kerH  n  kerH  '  +  Rb , 


kerH  n  kerH  ' 


We  close  this  section  by  referring  to  [7]where  results  on  robust  controller  syn¬ 
thesis  are  derived  in  a  context  which  is  more  general  than  the  one  considered  here  since 
it  allows  more  than  one  uncertain  loop  in  the  system  and  in  addition  considers  white  noise 
stochastic  disturbances.  The  results  obtained  inf  7] use  the  same  geometric  concepts  exposed 
in  this  secture,  but  arrive  at  results  which  exploit  the  structure  of  the  system  in  a  much 
more  subtle  and  effective  manner. 


APPENDIX 


rhe  most  basic  relevant  algorithms  in  this  area  are  the  following 


( ISA) :  V 

( ACSA)  :  R 

(  ACS  A  )  '  :  S! 


,k  ♦  1 

*  kerH  f) 

A-‘<Vk 

kerH 

kerH 

k  1 1 

k 

kerH 

*  kerH  n 

(ARkerH  + 

k  +  1 

k 

kerH 

=  i  mB  + 

A  Ske  r H  ^ 

KerH  *  ,0} 
SkerH  *  ,0} 


These  algorithms  attain  their  limits  mono ton i ca 1 ly  (hence  in  a  finite  number  of  steps  < 
diraX)  and  we  have:  ~ 


VkerH  VkerH'  Iciw  RkerH  Ra,kerH'  Ji”  (VkerH 


RkerH>“  V*a,K'- 


lim  (V  n  u)  =  R?  u;  and  lim  S, 

k-*oo  kerH  kerH  kerH  K-*  oo  kerH 

The  dual  algorithms  are  the  following: 


k  k 

S.  „  "  (AR.  T,  +  imB)  =  R*  , 

kerH  k-»  °o  kerH  b,kerH 


(CISA)  : 
(ACOSA)  : 
(ACOSA)  1  : 


=  imG  +  (A  ^  N .  .)  0  kerC; 

imG  imG 

V.+  =  kercfl  A  1  (V*  „  +  imG)  ; 

l mG  imG 


s:.c  ■  <»> 

-  - 


These  algorithms  attain  their  limits  monotonica 1 ly  (hence  in  a  finite  number  of  steps  ^ 
dimX)  and  we  have 


lim  S.  _  *  S* 
k-»«>  imG  imG 


lim  (S  +  N  )=n* 

k-»  oo  i  mG  l mG  t  mG 


N*  .  r. 
a  ,  i  mG 


;  lim  (S.  ON.  )  =  S*  .  ; 

k-»oo  imG  imG  a,iraG 

Vk  .  =  lim  (fl-1Nk  )  n  kerC  =  S*  . 
i  mG  K-*  oo  l  mG  b,n 
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SUMMARY 


These  notes  present  in  summary  fashion  the  essential  results  of  the  author'3 
work:  Linear  Multivariable  Control:  A  Geometric  Approach  (Second  ed.,  Springer- 

Verlag,  1979) ,  together  with  some  more  recent  developments  on  nonlinear  systems. 
Part  I,  Linear  Systems,  deals  in  turn  with  fundamentals,  disturbance  decoupling 
and  output  stabilization,  controllability  subspaces,  noninteracting  control,  and 
regulation  and  tracking.  Part  II,  Nonlinear  Systems,  summarizes  structurally 
stable  nonlinear  regulation  with  step  inputs,  and  concludes  with  a  version  of  the 
Internal  Model  Principle  for  controlled  sequential  machines. 


PART  I  -  LINEAR  SYSTEMS 

1.  LINEAR  MULTIVARIABLE  SYSTEMS.  FUNDAMENTALS . 

1 . 1  State  Space  Description 

x  ( t )  =  Ax(t)  +  Bu(t>  ,  t  4  0 
(or  x(t+l)  =  Ax(t)  +  Bu(t),  t  =  0,1,2,...) 

x  =  state  vector  i  X  =  IRn 
u  =  control  vector  e  U  =  Ir"1 
A:X  >  X,  B:U  *  X 
External  control:  u  =  v(t) 

S  ta  te  feedback  control:  u  =  Fx  It),  F:X  -*  U 

Combined  control:  u  =  Fx  +  v, 

x(t)  =  (A+BF)x(t)  +  Bv  ( t ) 

System  outputs: 

y(t)  =  Cx(t),  C:X  ■»  V  =  W?  (measured  output) 

z(t)  =  Dx(t),  D:X  ♦  Z  »  IR^  (regulated  output) 

1 . 2  Signal  Flow  Graphs 

x  =  Ax  +  Bu 
u  =  Fx  +  v 
z  =  Dx 

Take  Laplace  transforms: 

x(s)  =  (sl-A)  ^Buts) 
u(s)  =  Fx(s)  +  v(s) 
z  ( s )  =  Dx  ( s ) 
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i 

I 

l 

I 

l 


1 . 3  Controllability,  Feedback  and  Pole  Assignment 

x(t)  =  Ax ( t)  +  Bv ( t }  ,  t  ^  0 

x(0)  =  xQ 

t 

x(t)  =  etAxQ  +  |  T'ABv(t)<3t 
0 

What  states  can  you  reach  at  t  =  T  (say)  by  appropriate  selection  of  v(t), 
0  s  t  s  T  ? 

Let  Xg  =  0  . 

n  . 

Have  er  =  ]  U'  (  t  )  A 

r  =  l  r 

Then 

n  T 

x  (T)  =  l  !{/  (T-T)Ar_1Bv(T)dT 
r=l  J  r 
r  1  0 

Let  R  :=  f  x (T) :  v ( - )  continuous  on  [ 0  ,  T ) } 

Fact:  R  =  Im[B  AB  ...  An_1B] 

Let  8  :=  Ira  B  =  BU  c  X 
Then  R  =  8  t  AS  t  ...  +  An_18 
=  :  <  A | 8  >  <  X  . 

(A,B)  is  controllable  if  <  A I  8 >  =  X  . 

Suppose  ^  A  j  8  >  /  X  . 

Let  P:X  •»  X  :=  X/>A[8>  . 


Write  X  =  Xt  ®  X2 


Xx  =  -A|B>,  X2  =  X  . 


[a. 

A, 

B,  n 

A  =  1 

3| 

L  B  - 

1 

Lo 

A2^ 

-  0  . 

The  signal  flow  exhibits  the  system  decomposition. 


i 
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B  (sl-A,)"1 

U  - - - - -«t - = - X>  X, 


(sl-A2> 


controllable 


uncontrollable 


1.  Fact:  (VF)  <A+BF|8>  =  <A|B> 

i.e.  Controllable  subspace  <A]8>  is  invariant  under  application  of 
state  feedback. 

2  .  Let  dim (X)  =  n  . 

Consider  a (A+BF) 

a  =  symmetric  set  Ap,  of  n  complex  numbers 
Fact :  a (A+BF)  =  poles  of  (sl-A-BF)  ^ 

Theorem  ("Pole  assignment"): 

(A, 8)  is  controllable  [i.e.  <A|8>  =  X) 
iff 

(VA  c  (C,  symmetric,  1 A  |  =  n)(3F:X  +  U)  :  o(A+BF)  =  A. 


Stabilizability 

Split  complex  numbers  I  =  0  (g,b  mean  ’good’,  'bad') 

e.g.  <T  =  (s:  Re  s  <  Ol 

g 

A:X  -*  X  is  stable  if  a  (A)  c  <£  . 

-  g 

(A,B)  is  stabilizable  if  3F:  A+BF  is  stable. 

Let  characteristic  polynomial  of  A  be  n  (s) . 

Factor  it  (s)  =  v  ( s )  tt  ( s )  ,  zeros  of  it  in  I  ,  etc. 

g  b  g  g 

Unstable  modes  of  A  is  subspace 

X,  (A)  =  Ker  +.  (A)  . 
b  b 

{X  =  X,  (A)  ®  X  (A)  ) 
b  g 

Proposition.  (A,B)  is  stabilizable  iff 

X.  (A)  t  < A j  8 >  , 
b 

i.e.  unstable  modes  are  controllable. 


2.  DISTURBANCE  DECOUPLING  AND  OUTPUT  STABILIZATION 
2 . 1  (A, B) -Invariant  Subspaces 

x  =  Ax  +  Bu  +  Sq 
z  =  Dx 

q  =  q ( * )  is  an  exogenous  disturbance 


Temporarily  set  u  =  0.  Let  S  ■  In  S.  Disturbances  act  in  subspace 
'A|s>.  Output  z ( • )  is  decoupled  from  q(-)  iff 

D-A|S>  =  0  . 


Now  let  u 


Fx  . 
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DDP:  ?3F :  D<A+BF |S>  =  0. 

i.e.  ?3F:  D(A+BF):I"1S  =  0,  jcn  . 

Nasty  nonlinear  problem  in  F! 

Recast  in  geometric  form 

?3F:  <A+BF[S>  c  Ker  D 

The  foregoing  problem  motivates  the  introduction  of  (A, B) -invariant  subspaces 
Let  P  (=  1/  )  =  <A+BF|S>  . 

r 

Note:  (A+BF)P  c  l/. 

Problem:  Given  A,B(  and  any  subspace  X 

?3F:  (A+BF) P  c  P 
Call  such  P:  (A,B)  -  invariant . 

Fact :  P  is  (A, B) -invariant  iff 

AP  c  P  +  8. 

Notation:  I_ ( X )  =  {P:  AP  c  1/+8} 

£(Kec  D)  =  {V:  P  c  Ker  D  &  AV  c  P+B) 

Note:  0  <  [(Ker  D)  /  0  . 

2 . 2  Solution  of  Disturbance  Decoupling  Problem 
Proposition.  DDP  is  solvable  iff 

3P:  P  t  MKer  D)  &  P  3  S  . 

(Proof.  (If) 

Let  F:  (A+BF)  Pci/. 

Then  <A+BF|S>  c  <a+BF|P> 

=  P  +  (A+BF) P  +  . . . 

=  P  c  Ker  D. ) 

Proposition.  Pj,P2  €  L*Ker  ^1*^2  e  L*Ker 

Proposition.  3!  P*  t  MKer  D)  : 

(VP)  P  «  M  Ker  D)  =*  P  c  P* 

Write:  P*  =  sup  MKer  D)  . 

Theorem 

DDP  is  solvable  i ff  P*  =>  S  . 

Algorithm 

P°  :=  Ker  D 

P^+1  :=  Ker  D  n  a'V’+B)  ,  j  =  0,1,2,... 

Pn  =  P*  . 

Signal  Flow 

Let  X  =  X  ®  X2 
X  =  P*,  X2  2  X/P* 
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A+BF  = 

IA1  A3_ 

D  =  [0  D.)  ,  S  = 

Pll 

.0  a2. 

0 

Theorem 

Let  V*  :=  gup  IJKer  D)  . 
OSP  is  solvable  iff 

Xfa(A)  c  < A |  8^  +  l/*. 


De 


t (A+BF) 


has  all  its 


3.  CONTROLLABILITY  SUBSPACES 

3.1  Controllability  Subspaces :  Definition  and  Characterizations 


x  =  Ax  +  Bu 

R  <-  X  is  a  controllability  subspace  (c.s. ) 
if  3F:X  ■+  U  6  G:U  ■*  U  such  that 
R  =  <A+BF 1 Im(BG) > 


F 


Notation  C(X) ,  C(Ker  D) 

Fact:  Rj  ,R2  t  C(Ker  D)  <  C(Ker  D) 

Fact :  3!  R*  =  sup  C(Ker  D) 

Fact:  0  <  R*  r  V*  c  Ker  D  c  X  . 


Notation 

F(V*)  :=  i  F:  (A+BF)  V*  c  l/*}  Say:  "F  is  a  friend  of  l/*" 


F ( R * )  :=  F:  (A+BF) R*  c  R*) 


Fact :  F  ( 1/  * )  F(R*).  "Any  friend  of  V*  is  a  friend  of  R* 


»  (A+BF)jR* 


l/*/R* - - - >V*/R* 


Transmission  Zeros 

Fact :  Ap  is  the  same  for  all  F  <  F(V*> . 

Jargon:  o (A^  is  the  set  of  transmission  zeros  of  (D,A,B) . 

Disturbance  Decoupling  Problem  with  Stability  (DDPS) 

,A+BF  is  stable,  & 

DDPS:  ?3F  :  ^ 

A  +  BF | S •  '  Ker  D 

M 

I 


Theorem 

(i) 

&  (ii) 


DDPS  is  solvable  iff 
( A , B )  is  stabilizable 
I/*  i  S  , 

g 


NONINTERACTING  CONTROL  (DECOUPLING) 

x  =  Ax  +  Bu 

z  .  =  D  x  i  <•  k 

11  — 

k 

Let  u  =  Fx  +  7  G.v. 

i=l  1  1 

1  Restricted  Decoupling  Problem  (RDP) 

?3F,  (it  k)  such  that  controls  z^  without  affecting  z^,j  ^i 
Let  R  =  'A+BF| Im(BGi) > 

RDP  can  be  stated: 

73  c. s .  R .  ,  i>k: 

l  - 

n  FIR  )  /  0,  D. R.  =  Iro  D. , 
i=l“  1  i  1  i 


l 


Unsolved.  Add  more  state  variables 


R  .  r  n . Ker  D.  . 
1  3 
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4 . 2  Extended  Decoupling  Problem  (EDP) 


Adjoin  auxiliary  dynamics 


x  =  B  u  ,  x  «  X  ,  ucU. 
a  3  a  a  a  a  a 


Extended  spaces  X  =X®X,  U  =  U  ®  U  . 

e  a  e  a 

A  Ol  [BO' 

Extended  maps  A  =_  ,B=  ,  D.=(D.  0) 

e  0  0  6  0  B  16  1 


EDP  Find  ng  and  extended  c.s.  (it  k)  such  that 

S .  c  n  Ker  D.  ,  n  F  (S.)  f  0  , 

1  j*i  iek-e  i 

Si  +  Ker  Die  ~  Xe  '  1 £  -* 

Theorem  EDP  is  solvable  iff,  for  the  original  RDP, 
R*  +  Ker  D.  *  X,  if  k. 

l  l  — 

Then  can  choose 

k  k 

na  =  l  d  ( R*)  -  d(  [  R*). 


4 . 3  Generic  Solvability  of  EDP 
Regard 

|3  * =  (D^»  . . . .D^,  A,B) 

N 

as  a  data  point  in  1R  , 

N  =  (q^+.-.+q^Jn  +  n2  +  nm. 

Fix  q,  ,  . . ,  ,q,_,n,m.  EDP  is  generically  solvable  if  solvable  for  all  p  off 

1  K  n 

some  proper  algebraic  variety  in  ]R  . 


EDP  is  generically  solvable  iff 


£  q  <  n 
i  =  l 


(generic  noninteraction) 


i!l  *  \  q.  -  min  q.  (generic  output  controllability) 

i=l  1  1  si  <k  1 

(if  either  condition  fails,  EDP  is  generically  unsolvable!) 

Then  can  take,  generically, 

k 

n  =  (k-1) (n  -  l  q  ) 
a  i=l  1 


REGULATION  AND  TRACKING 


x  =  Ax  +  Bu 


y  =  Cx  (measured) 

2  =  Ox  (to  be  regulated) 
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5 . 1  Regulator  Problem  with  Internal  Stability  (RPIS) 

Find  (dynamic)  feedback  from  y,  i.e. 
u(s)  =  F(s)y  (s)  , 
such  that 

(i)  All  controllable,  observable  modes  are  stabilized. 

(ii)  Output  z  is  regulated,  i.e.  for  all  i.c.,  z(t)  -*  0  as  t-*-1”. 


x2  =  A2x2 


(Typical  format: 

X1  =  A1X1  +  A3X2  +  B1U  ' 

(A^.B^)  -  controllable  plant 

A2  -  exosystem,  A^  -  disturbance  coupling) 

Admissible  controls 

Could  try  static  output  feedback,  i.e. 

u  =  Fx,  F  =  FC  or  Ker  F  = Ker  C. 
Intractable! 


Better 

Ker  C 


y  =  C  x  , 
new  new 


new 


=  N  :  =  n  Ker  C  A 
i  >1 


i-1 


u  =  Fx,  Ker  F  :>  N  . 

Tractable!  since  AN  c  N  . 

This  is  algebraically  equivalent  to  using  a  dynamic  observer: 


u 

y 


Observer 


x  mod  N 


Internal  stability 

Any  observable,  unstable  modes  should  be  uncontrollable. 
I  =  (s:  Re  s  :0)  0  {s:  Re  s<0) 

=  <t+  u  (t" 

Require: 

X+ (A+BF ) +N  <Al8>+N 

— 17 - n  4r —  =  0 

or 

X+(A+BF)  n  ( < A | 6>+N)  c  N  . 

Output  regulation 

Bury  unstable  modes  in  Ker  D,  namely  arrange  that 
X+(A+BF)  c  Ker  D 

RPIS 

Given  A:X  -»  X,  B:U  *  X,  D:X  -♦  l,  and  N  c  X  with  AN  c  H . 
Find  F:X  »  U  such  that 

(i)  Ker  F  3  N 

(ii)  X+(A+BF)  n  (^A|8>+N)  <  N 
(Hi)  X+  (A+BF)  t  Ker  D 


8-9 


Theorem 

RPIS  is  solvable  iff  3R  c X  such  that 

(1)  V  c  Ker  D  nA_1(W) 

(2)  X+(A)  nil  +  A(VnN)  c  V 

(3)  V  n  (<A| 8>+N)  c H 

(4)  X+(A)  c < A | 8>  +  V 

5 . 2  Separation  Theorem  for  RPIS 

Extra  dynamic  compensation  (beyond  the  observer)  does  not  help! 


5 . 3  Solution  of  RPIS:  Geometric  Structure 
Assume  first  that  N  =  0. 

V*  :=  sup  MKer  D)  ,  R*  :=  sup  CJKer  D) 

Recall:  F((/*)  c  F  ( R* )  . 

If  AF  :=  A+BF,  F  e  F(V*) ,  then  A^.,  induced  on  V*/R*(  is  the  same  for 
all  F  £  F ( V*) . 

Let  AT  c  T  and  AR  c  R  c  T. 

Def .  R  decomposes  T  wrt  A  if  3  S  c  T  ,  R®S  =  T ,  A  S  c  S. 

Fact  R  decomposes  T  wrt  A 

iff  {e.d.  A |  R }  u  {e.d.  of  A  on  T/R!  =  {e.d.  of  A|T) 

iff  A^V  -  VA2 + Aj  =  0  is  solvable  for  V. 

Theorem 

Let  N  =  0.  RPIS  is  solvable  iff 

(i)  X+(A)  c  <a|8>  +  V* 

& 

(ii)  In  X/R*,  with  F<r(l/‘), 

V*  n  X+(Af)  n <A|B>  +  R* 

_ 

decomposes 

V*  n  X+(Af>  +  R* 

R* 

wrt  map  A^,  induced  by  Ap  in  V*/R*. 

(Condition  needs  to  be  checked  only  at  one,  arbitrarily  selected,  F  £  F ( V* ) . } 
Proof  of  Theorem  (if) 
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V  n X  (Ap) 


V*  n  X+(Ap)  n  <a',B> 


by  decomposability 


l!  has  required  properties 


Corollary  1 


Assume:  R  =  0 


&  of  induced  map  A  on  X/<a|B>)  c  j 

RPXS  solvable  iff 
(i)  <A|  B>  +  V*  =  X 

&  (ii)  V*  h<a|8>/R*  decomposes  V*/R*  wrt  Ap  induced  on  V*/R*. 


Let  X  :=  <A|B>,  ax  :=  A|<A|8>,  etc. 


o*  :=  of  induced  map  on  V*/R* ) 


Corollary  2 


Assumptions  as  in  Corollary  1. 
RPIS  solvable  provided 
(i)  <Aj8>  +  V*  =  X 


&  (ii)  o*  n  o (A)  =0 


Example  (RPXS  not  solvable) 


R*  =  0  , 


AFltr  ~ 


V  ,■  --A  ;  6>  = 


Decomposability  fails! 


General  solution  of  RPIS  (K  ?  0) 
Theorem.  RPIS  is  solvable  iff 


(i)  X  (A)  r,  N  -  Ker  D 


(ii)  In  X  :=  X/X  (A)  n  N,  the  reduced  RPIS  is  solvable, 
i.e.  3F: X  >  U  such  that 


X  (A+BF)  ■  Ker  5 
&  X+  (A  +  BF)  i,  -A|  S>  -  0  . 
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5 .4  Solution  of  RPIS:  Linear  Maps  Criterion 
Assume  (i)  o(exosystem)  c  (t+ 

(ii)  X+  (A)  n  II  =  0 
(iii)  D<A|S>  =  Z 

Theorem 

RPIS  solvable  iff  31/  c  x  such  that 
V  c  Ker  D  n  A-1  (  U+8 )  , 

&  <A| 8>  ®  V  =  X  . 

Let  P:X  -  X/<A|  8>  =:  X  . 

Corollary 

RPIS  solvable  iff  3  maps  V:X  -*■  X  4  K:X  ■*  U  such  that 
AV  -  VA  +  BK  =  0  ,  DV  =  0  ,  PV  =  1  . 


5 . 5  Well-Posedness  and  Generic  Solvability 
Coordinatization 


f  A  A,] 

B.  1 

1  3 

,  B  = 

1  1 

l  0  a2 

o  1 

C  =  (Cx  C2) ,  D  =  (Dx  D2)  . 

A^nj  «  nj,  A2:n2*n2,  B^in^xm,  D^qxnj 


(A^ ,B^)  controllable  •  plant 

o  (A2)  c(i+  •  exosystem 

Rank  -  q  (full  rank) 

Data  point  £  =  (A^.A^.B^)  e  IRN  , 

2 

N  =  n^  +  n^n2  +  n^m  . 

N 

Def .  RPIS  is  well-posed  at  £  if  solvable  throughout  a  nbhd  of  £  in  IR 

Def .  Fix  n^,  m,  q.  RPIS  is  generically  solvable  if  solvable  for  all 

£  e  IR  ,  off  some  proper  algebraic  variety. 

Theorem 

RPIS  is  well-posed  at  £  =  ( A^  A^  B^)  iff 

(Aiei^  -  118A^)  Ker(D1«l^)  +  ImfB^l^)  =  Xj  ®  X^  . 

Corollary 

RPIS  is  generically  solvable  iff 

m  (=  no.  of  controls)  z  q  (  =  no.  of  regulated  outputs) 

If  m <  q,  no  data  point  is  well-posed. 

Theorem 

Assume  also:  (D^.A^)  observable 
4  o (A^)  n  o ( A2 )  =  0  . 
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RPIS  is  well-posed  at  (A^  A^  B^)  iff 
H1(i)  :=  D1(ll1  -  A1)_1B1 
is  right-invertible  (over  <t)  at  every  X  c  o  ( A2 ) 

Example  (RPIS  solvable  but  ill-posed) 


S  +S+1  s+1 


If  you  perturb  the  branch 


then  RPIS  becomes  unsolvable. 


5 . 6  Strong  ('Robust')  Synthesis 

Assume  RPIS  well-posed,  and  C  =  D. 


( S-A1 ) 


(s-a2) -1 I  d2 


De f  ■  (F^.A^.B^)  is  a  synthesis  if 

(i)  ( F  ,  A  )  observable  and  (A  ,B  )  controllable 

c  c  c  c 

(ii)  Loop  is  stable 

(iii)  Output  regulation  holds. 

Def.  Let  p  =  (A, rA,,B, I .  Synthesis  is  strong  at  p  if  properties 

1  i  1  w 

(i)  -  (iii)  hold  throughout  a  nbhd  of  g  in  F  . 

Theorem  Let  S  =  (F  ,A  ,B  )  be  a  synthesis. 

-  c  c  c  c 

Let  «2(X)  be  m.p.  of  A^. 

Let  T:T  *  T  be  cyclic,  with  m.p.  i ^ . 

Then  Sc  is  strong  at  d  iff 

3J:I  0  Z'  Xc,  injective,  such  that  diagram  commutes. 


T  0  Z  ' - : 


Say:  "A^  incorporates  an  internal  model  of  A2" 


I 
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Theorem 

RPIS  admits  a  strong  synthesis  at  £  iff  it  is  well-posed  at  £. 


5 . 7  Internal  Model  Principle 

Assume  y  =  Cx,  z  =  Dx, 

but  no  a  priori  relation  between  C  and  D. 

"Synthesis"  as  before,  but  now  with  y  as  input. 


"Strong"  as  before,  but  with  certain  variations  in  Bc . 


Theorem 

Synthesis  is  strong  only  if 

(i)  Ker  C  c  Ker  D  fi.e.  z  =  Ey,  some  E) 

&  (ii)  "A  incorporates  an  internal  model  of  A.,  that  is  controlled  by  z". 
c  ^ 


PART  II  -  NONLINEAR  SYSTEMS 


6.  STRUCTURALLY  STABLE  NONLINEAR  REGULATION  WITH  STEP  INPUTS 

Let  X  -  smooth  (C  )  manifold 
TX  -  tangent  bundle  of  X 
C,:X  •*  TX  vector  field,  complete 
System  is  a  pair  (X,f). 

Call  (X,0  the  regulator  (system). 

Bring  in  auxiliary  system 
(V,n)  -  exosystem 

Assume  3  surjective  submersion  n:X  V  such  that  we  have 


Assume:  (X,f)  admits  a  subsystem  'induced'  by  (V,n),  in  the  sense: 


Assume:  k  is  a  regular  embedding,  defined  uniquely  by  (1),  viz. 

foj  —  v  ^  o  i]  ,  —  id. 

In  applications  one  might  often  require  that  v(V)  be  a  stable  attractor 
for  the  flow  determined  by  C  on  X,  viz. 

(VxQ)x(t,x0)  »  v(V)  as  t  ♦  “}  . 

Bring  in  a  'good'  submanifold  (K,*) , 

K  — *  X  . 

K  is  where  you  would  like  to  be. 


?E.g.  . <K>  -  *-1(*0) . 

where  S :X  »  Z  is  an  output  map,  z^.  Z  a  setpoint.) 

Output  regulation  condition 
v  (V)  ■  i-  (K) 

So  a  embedding  1 :V  *  K  such  that  we  have 


Regulator  structure 
Assume 

TX  =  E„  ®  E  ®  E 
0  c  e 

where  the  vector  bundle 

Eq  is  identified  with  'plant' 

E  is  identified  with  'controller' 
c 

E^  is  identified  with  'exosystem' 

Thus 

E.  ®  E  =  Ker  it 
0  c  * 

=  TV 

*  e 

f'  =  S0  *  ^c  ®  f’e 

Regard  f  ,  f,  ,  ” ,  r  as  fixed, 
u  c  e 

Consider  variable  to  near  in  r  ( EQ )  . 

Assume  (1)  holds  near  i.e.  to  each  there  corresponds  v  such 

that  (1)  holds  with  C  =  'f'c' 'e* ' 

Say:  Regulation  is  structurally  stable  if  (2)  is  preserved  as  well. 

Namely 

(3  nbhd  H  of  f,Q)  (V?Q) 

?g  f  Nq  ==>  (31  )  V  =  K  o  1  .  (3) 

(Assume  Whitney  C  -topology  for  r(Eg)) 

Problem:  Assuming  structural  stability,  what  can  we  infer  about  the 

controller  C.  7  Consider  only  the  special  case  of 

Step  Inputs 

Assume  o(v)  =  <  T^V,  V’V. 

(1)  n  f  (x)  =  O  ,  . 

*  "  (x) 

Notation:  t,:X  •  E  is  zero  section; 

let  Z  =  r.  (X)  =  f  (x,  0)  :  x.  X1  . 

Structural  stability,  plus  (2)  &  (’),  mean  that 
ioroT(v)  =  0-(v). 

»  r  0T  (V)  •  Z 

X 


V<  V 
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C  Cqo* ° ' (v)  c  zr 
F  .foTlV)  < 


(i)  *c  *  zc  | 

(ii)  (£,  •  *)  1 ( Z  )  is  a  submanifold  of  K.  > 

c  c 

By  'density'  theorems, 

3^  such  that 

V*  *  Z0 

U0.O  I  (tc«.K)'1(Zc)  *  z0 

(To  justify  this  step  in  detail  need 
Lemma:  let  K  c  X,  Z  c  y  closed,  embedded  submanifolds. 


(f:  t  f  |  K)  S  Z) 

is  open  and  dense  in  C  (X,Y). 

Proof  (sketch):  Start  with  g  =  f ] K ,  in  a  canonical  coordinate  patch. 
Give  g  a  small,  polynomial  perturbation  g  i — >  g,  to  achieve  g  S  Z. 

Smooth  to  get  f=gonK,  f  =  f  off  a  nbhd  of  K.  Technique  as  in 
Golubitsky  &  Guillemin,  p.  55,  proof  of  Thom  transversality  theorem.) 

Note  that  if  (6a)  holds  and  (K)  n  Z^  /  0,  as  is  assumed  by  structural 

stability,  then  necessarily  nQ  s n-q,  viz.  q < m+n^. 

In  (6b)  ,  write 

K  :=  (C  «<)_1(Z  ) 
c  c  c 


( D<b )  (T  K  )  +  T.  ,  ,  Z  =  T  .  E. 
x  c  <Mx)  0  <t>  (x)  0 

-  TXKC  +  (M)'\(X)Z0  =  TxK 

—  Kc  *  #-1(Z0) 


(f.c-)'1(zc)  *  <VK)~1(V 


Introduce  dimensions 


m  n+nrt  n+n  n-q 
0  c 


Y0  :=  dim  (Zq) 

Yc  :=  dim  (tc=x)-1(Zc) 


codim  (6qok)  (Zq)  ■  codim  (Zg) 
n  -  q  -  Yn  =  dim  (En)  -  dim  (Zn) 


(n+n0>  -  n 
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Yo  =  n-q-no 

=  m  +  n  -  q 
c 

(7) 

(6c) 

dim  [  (f.cor)_1(Zc)  n  (r,Qo<  )  _1  (Z  )  ) 

=  max  (0,  Yg+Yc-(n~q)) 

(8) 

(4) 

dim  f.  .  .1  >  dim  i (V)  =  m 

(9) 

(8)  4  (9) 

— 

Y0+Yc‘n+q  “  m 

(10) 

(7)  4  (10) 

— 

Y  z  n-n 
c  c 

(11) 

But 

Yc  ^  n-q 

(12) 

11)  4  (12) 

— 

MAIN  RESULT:  n  i  q  . 

c 

(13) 

Interpretation: 

The  control  section  reduces  to  zero  on  a  certain  submanifold  of  K,  namely 
Kc<  of  dimension  >  ,  where  yc  ?n-nc.  By  (13)  at  least  q  'components'  of 
control  vanish  on  K^,  and  this  fact  has  the  engineering  interpretation  that 
the  inputs  to  at  least  q  'integrators'  in  the  control  loop  reduce  to  zero 
when  the  'error'  is  zero.  In  other  words,  we  have  a  version  of  the  Internal 
Model  Principle  for  nonlinear  regulation  against  step  inputs.  Constructing 
a  tubular  nbhd  of  k ( Kc )  in  X,  we  may  parametrize  it  as  usual  by  the  fibers, 
and  the  n-Yc  a q  'components'  of  the  fiber  vectors  provide  a  specific  reali¬ 
zation  of  the  required  feedback  variables. 

We  can  be  more  precise  about  yc  if  we 

Assume:  x(t,xQ)  — »  v  (V)  as  t  •*«■>,  all  xQ. 

Then  £_1(Z  )  =  v (V) 

— *  ?o1(V  nC(V  =  ^(v) 

Y0+Yc-n+q  =  ra  in  (10) 
nc  =  q  in  (13)  . 

Recall  K  :=  (£  ok)  ^(Z  ). 

c  c  c 

Then  y  =  dim  (K  ) 
c  c 

=  n-n_  =  n-q  =  dim  (K) 

c  * 

Thus  K  is  some  closed  submanifold  of  K,  having  the  same  dimension, 
c 


I  (V) 


I  (V) 
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7.  AN  INTERNAL  MODEL  PRINCIPLE  FOR  CONTROLLED  SEQUENTIAL  MACHINES 
7 . 1  Introduction 

A  long-standing  idea  in  the  control  literature,  and  in  several  other 
disciplines,  is  that  good  control,  on  the  part  of  an  organism  or  artifact, 
depends  on  the  formation  of  an  internal  model  of  the  outside  world.  We 
shall  discuss  one  way  in  which  this  'internal  model  principle'  can  be 
formalized,  in  the  context  of  controlled  sequential  machines.  In  conclusion 
we  speculate  on  the  implication  for  future  control  theories. 

First,  some  citations  from  the  literature. 

1.  O.J.M.  Smith  (1958): 

"Feedback  design  techniques  (can  be)  based  on  predictors  which  incor¬ 
porate  models  of  the  mechanism  of  generation  of  the  various  signals  being 
predicted” . 

2  .  C.R.  Kelley  (1968)  : 

"Prediction  and  planning  involve  the  operator’s  internal  model  of 
the  control  process  and  the  variables  in  the  environment  that  affect 
it  ...  Man's  ability  to  interact  with  the  environment  depends  on  the 
accuracy  of  his  internal  model  of  that  environment". 

3.  K.  Oatley  (1972) : 

"One  fruitful  idea  is  that  the  brain  is  a  complex  information¬ 
processing  device  that  contains  an  internal  model  of  the  outside 
world. . .The  brain,  in  other  words,  acts  as  a  model  in  which  neural 
processes  symbolize  the  workings  of  the  external  world  and  thus  allow 
us  to  predict  the  outcome  of  events  and  of  our  own  actions". 

(Credit  to  K.  Craik  (1967)). 

4.  James  Albus  &  John  Evans  (1976): 

"Efforts  to  equip  a  robot  with  modest  intellectual  capacities  have 
occupied  workers  in  the  field  of  artificial  intelligence  for  many  years. 
In  most  instances  the  effort  has  required  that  the  upper  levels  of  con¬ 
trol  incorporate  some  internal  data  structures  -  a  "world  model1',  or 
knowledge  frame  -  that  can  represent  the  state  of  the  environment  in 
a  meaningful  way.  The  robot  must  also  be  endowed  with  an  adequate  re¬ 
pertory  of  sensors  and  data-processing  circuits  for  analysing  the  en¬ 
vironment  so  that  the  robot  can  keep  its  internal  knowledge  frame  up  to 
date" . 


7 . 2  Abstract  Internal  Model  Principle 

System  is  abstract  automaton 
State  transition  map  a:X  *  X 
Controller  y  :X  -  W  :x  i — »  y  (x)  =  w 
W  is  controller  state  set. 


Good  set  K  c  X 

K  is  where  you  would  like  to  be. 


(Typical  situation:  X 

K  =  K  *  X  ,  K  c  X 
SC 


X  X  X 
s  c 


y:  (x  .x  ) 
s  c 


X  ) 

c 


Assume:  y (K)  =  W 

i.e.  "Knowledge  merely  that  xc  K  yields  no  information  about 
value  of  y (x)  e  W. " 

Exosystem  -  "outside  world  as  seen  by  controlled  system".  Assume 
description  incorporated  in  o,  viz.  there  is  a  subsystem  X+  <r  X, 

X*  c x,  a (X+)  c  x+. 

X+  could  be  determined  aB  follows.  Postulate  that  the  exosystem  is 
modeled  by  a  pair  <xe«ae)  that  drives  the  total  system  (X,a)  in 
accordance  with  the  diagram 


X 


X 


Here  ”  is  a  surjection.  Now  assume  that  (xe<ilg)  is  coupled  to  (X,u) 

in  such  a  way  that  the  total  'action'  due  to  (X  ,a  )  is  uniquely 

+  +  ®  ® 

determinable  as  an  induced  subsystem  (X  ,a  ),  viz.  there  exists  a 
unique  injection  v:Xe  *  X  such  that  we  have  the  diagram: 


Consider  time  history  fx(t)st«  Z  +  ), 
x  ( 0 )  =  X()  t  X 
x  ( t  +  1)  =  a  (x  ( t)  )  ,  t  >  0 . 


The  resulting  sequence  of  control  states  is 
w(t,xQ)  =  y  (x(t)  )  ,  t  <  Z+ 

We  would  like  to  recover  xQ  from 

fwIt.Xjj)  it  e  2  +  ),  for  certain  xQ  . 

For  this  we  need  concept  of  observer. 

Observers 

Let  0:X  »  T  be  an  'output'  map. 

Write  x  :  x'  mod  0«=»9(x)  =  0(x'). 

Identify  8  —  equivalence  relation  on  X. 

Let  E(X)  be  class  of  all  equivalence  relations  on  X. 

If  8>  E  (X)  ,  also  identify  8  with  projection  X  — >  X/9 . 

Also,  9  ~  partition  of  X. 

If  0,n  *  E  (X)  ,  say  r,  <  9  (n  is  finer  than  0)  if  each  equivalence  class  of 
'(  is  a  subset  of  some  equivalence  class  of  9. 

n  -•  9  x 


x/n - x/o 

i.e.  8  factors  through  n . 

Fact :  '  is  a  partial  order  on  E (X)  and  induces  a  lattice  structure. 

Namely  for  any  n,o  .  E  (X)  there  exists  4  :=  n A with  the  properties 

1  '  n  &  <  0 
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i  (ViJj)  :  i|i  <  n  and  t(i  <•  0  *«*  i(j  <  41 

Similarly  there  exists  a>  :=  n»0 

(replace  <  by  >  in  definition  of  a) 

Fact :  t ( X )  is  complete ,  i.e. 

F  '  f (X)  — ■ »  sup  f ,  inf  Ft  E  (X)  . 

Note:  0  :  =  inf  E (X)  =  { (x) :x  <  X) 

1  :  =  sup  E  (X)  =  { X > 

Can  now  define: 

Given  (0 , u)  ,  a  :X  -»  X,  0  t  E  (X)  ,  the  corresponding  observer  is  »  e  E  (X)  , 
u>  :  =  sup  { w '  1  <  E  (X)  &  «'  '  8»((u'  »a)  I 
(Here  x  x‘  mod  a i'«a  *»=>a(x)  =  a(x')  mod  o’) 

u  is  the  coarsest  partition  of  X  that  is  finer  than  0  and  is  also 
a  congruence  for  a. 

Can  show :  w  =  inf  f0oa*  ^ ,  i>l). 

Say:  (8, a)  is  observable  if  u>  =  0. 

Now  let  E  c  X,  o(E)  ’  e/  =  a  |  E  etc. 

If  a)  is  observable  for  a  then  is  observable  for  a  . 

fc/  t 

E  is  detectable  rel.  (0,ct)  if  (0E,cig)  is  observable. 

Assume:  >:+  detectable  rel.  (  y  ,  01 )  . 

Feedback 

Controller  is  actuated  externally  only  when  system  state  deviates  from 
good  set  K. 

Let  x  <  K. 

Control  state  is  y(x)  <  W. 

Successor  control  state  is  yoa(x)  >.  W. 

Assume:  y.a(x)  depends  only  on  y(x). 

viz.  (y»a)K  >  yR 


Summary  of  assumptions 


X,  a:X  -  X 

(1) 

W,  y:X  ->  W 

(2) 

K  c  x,  y  (K)  =  W 

(3) 

X+  <  K,  a (X+)  c  X+ 

(4) 

X+  detectable  rel. 

(y  ,a) 

(5) 

yK  <  ly.«)K 

(6) 

Put 

a+  :=  a |x+,  y+ 

«  rU+ 

Theorem 

(i) 

31  map  a:W  *  W  determined  by 

(a© Y ) | K  =  (y 

oa)  |  K 

(ii) 

+  +  + 

a© y  *  y  °a 

(iii) 

Y+  is  injective. 
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We  have 


X 


+ 


X 


+ 


y 


w 


ct 


Y 


+ 


W 


Proof 

( i )  Let  w  fW. 

(  3)  — ♦  3x  *  K ,  y  (x)  =  w . 

Let  a  (w)  :=  y ®a  (x)  , 

OK?  Let  x 1  t  K ,  y (x' )  =  w. 

=*  x  x '  mod  y 

x  x'  mod  Yo0t»  by  (6) 

{  YoOt)  X  =  <  Yo3)  (x  '  ) 

( i i)  Let  x  t  X+ .  By  { i ) , 

X+  K  -*  toy(x)  =  Yoa(x) 

a(x)  =  x+(x)  6.  a(x)  f  X+  >  aoY+(x)  -  Y+°a+(x). 

(iii)  Let  =  observer  for  (y,  O  . 

+  + 

:=  *>  X  ,  etc. 

(5)  -..+  =  sup  iv'swV  €  (X+)  &  oj ’  <  y  +  a  '  oct+)  ) 

=  0. 

+  +  + 

But  (n)  r  o-Ji 

So  i  *  '  ■.  +  -  0 

■f 

i.e.  t  is  injective.  QED 

7  .  3  Conclusion 

While  the  role  of  internal  models  now  appears  to  be  central  in  many  pro¬ 
cesses  of  prediction  and  regulation,  formal  demonstrations  of  the  existence 
of  an  internal  model,  and  of  how  an  internal  model  may  be  used  (together  with 
a  stabilizer)  in  a  feedback  loop  to  achieve  good  control,  are  still  few  and 
somewhat  specialized.  However,  such  facts  as  we  possess  do  point  to  the 
intriguing  possibility  of  a  'general  systems  theory'  considerably  deeper 
and  mo;e  substantial  than  exists  under  that  title  at  the  present  time. 
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MULTI  VAAI  ABIE  DESICH  :  THE  OPTIMIZATION  OP  APPROXIMATE  INVERSES 

G.  Zaraes 

Dept,  of  Electrical  Engineering,  McGill  University, 

3480  University  St.,  Montreal,  Canada  H3A  2A7 


A  new  mathematical  framework  for  the  analysis  of  sensitivity  in  multi-input-multi-output  linear  feed¬ 
back  systems  is  proposed,  based  on  the  concepts  of  a  multiplicative  seminorm  and  an  approximate  inverse. 

Many  of  the  empirical  results  of  "classical"  control  theory  pertaining  to  the  use  of  lead- lag  networks 
can  be  deduced  in  this  framework  as  solutions  to  well-posed  mathematical  optimization  problems.  Moreover, 
new  classes  of  optimal  filters  for  the  reduction  of  sensitivity  are  introduced. 

A  definition  of  optimal  sensitivity  to  plant  uncertainty  is  established.  The  multiplicative  property 
of  seminorms  is  used  to  obtain  the  following  principle:  For  any  specified  aposteriori  accuracy,  there  is 
a  maximum  of  apriori  plant  uncertainty  that  can  be  tolerated,  and  a  minimum  of  identification  that  is  re¬ 
quired. 

INTRODUCTION 

In  this  expository  sumnary  of  Zames  (1976-81)  we  are  concerned  with  the  effects  of  feedback  on  uncer¬ 
tainty,  where  uncertainty  occurs  either  in  the  form  of  an  additive  disturbance  d  at  the  output  of  a  linear 
plant  F. 


d 


Fig.  1 

or  an  additive  perturbation  in  P  representing  "plant  uncertainty".  We  approach  this  subject  from  the  point 
of  view  of  classical  sensitivity  theory,  with  the  difference  that  feedbacks  will  not  only  reduce  but  actually 
optimize  sensitivity  in  an  appropriate  sense. 

The  theory  is  developed  at  two  levels  of  generality.  At  the  higher  level,  a  framework  is  sought  in  which 
the  essence  of  the  classical  ideas  can  be  captured.  To  this  end,  systems  are  represented  by  mappings  be¬ 
longing  to  a  normed  algebra.  The  object  here  is  to  obtain  general  answers  to  such  questions  as:  How  does 
the  usefulness  of  feedback  depend  on  plant  invertibility?  Are  there  measures  of  sensitivity  to  plant-uncer¬ 
tainty  that  are  natural  for  optimization?  How  does  plant  uncertainty  affect  the  possibility  of  designing 
a  feedback  scheme  to  reduce  plant  uncertainty? 

At  a  more  practical  level,  the  theory  is  illustrated  by  simple  examples  of  involving  single-variable  and 
multivariable  frequency  responses.  The  questions  here  are:  Can  the  classical  "lead-lag"  controllers  be  de¬ 
rived  from  an  optimization  problem?  How  do  rnp  (right-hal f-plane)  zeros  restrict  sensitivity?  In  multi - 
variable  systems  without  rhp  zeros,  can  sensitivity  be  made  arbitrarily  small,  and  if  so  how? 

A  few  observations  might  serve  to  motivate  this  re-examination  of  feedback  theory.  One  way  of  attenu¬ 
ating  disturbances  is  to  introduce  a  filter  of  the  W.H.K.  (Wiener-Hopf-Kalman)  type  in  the  feedback  path. 
Despite  the  unquestioned  success  of  the  W.H.K.  and  state-space  approaches,  the  classical  methods,  which  rely 
on  lead-lag  "compensators"  to  reduce  sensitivity,  have  continued  to  dominate  many  areas  of  design.  On  and 
off  there  have  been  attempts  to  develop  analogous  methods  for  multivariable  systems.  However,  the  classical 
techniques  have  been  difficult  to  pin  down  in  a  mathematical  theory,  partly  because  the  purpose  of  compensa¬ 
tion  has  not  been  clearly  stated.  One  of  our  objectives  is  to  formulate  the  compensation  problem  as  the  so¬ 
lution  to  a  well-defined  optimization  problem. 

Another  motivating  factor  is  the  gradual  realization  that  classical  theory  is  not  just  an  old-fashioned 
way  of  doing  WHK ,  but  is  concerned  ith  a  different  category  of  mathematical  problems.  In  a  typical  WKH 
problem,  the  quadratic  norm  of  the  response^to  a  disturbance  d  is  minimized  by  a  projection  method;  in  a 
deterministic  version,  the  power  spectrum  |d(jw)|  is  a  single,  known  vector  in,  e.g.,  the  space  L2  (-00,*) » 
in  stochastic  versions,  d  belongs  to  a  single  ramdom  process  of  known  covariance  properties.  However,  there 
are  many  practical  problems  in  which  d  is  unknown  but  belongs  to  a  prescribed  set  of  disturbances,  or  to  a 
class  of  random  processes  whose  covariances  are  uncertain  but  belong  to  a  prescribed  set.  For  example,  in 
audio  design,  d  is  often  one  of  a  set  of  narrow-band  signals  in  the  20-20K  Hertz  interval,  as  opposed  to  a 
single,  wide-band  signal  in  the  same  interval.  Problems  involving  such  disturbance  sets  are  not  tractable 
by  W.H.K.  or  projection  techniques.  One  objective  here  is  to  find  a  systematic  approach  to  problems  in¬ 
volving  such  sets  of  disturbances. 

Another  observation  is  that  many  problems  of  plant  uncertainty  can  be  stated  easily  in  the  classical 
theory,  e.g.  in  terms  of  a  tolerance -band  on  a  frequency  response,  but  are  difficult  to  express  in  a  linear- 
quadratic-state-space  framework.  One  reason  for  this  is  that  frequency-response  descriptions  and,  more 
generally,  input-output  descriptions  preserve  the  operations  of  system  addition  and  multiplication,  whereas 
state-space  descriptions  do  not.  Another  reason  is  that  the  quadratic  norm  is  hard  to  estimate  for  system 
products,  whereas  the  induced  norm  (or  "gain")  that  is  implicit  in  the  classical  theory  is  easier  to  esti¬ 
mate.  We  would  like  to  exploit  these  advantages  in  the  study  of  plant  uncertainty. 


Weighted  Seminorms  and  Approximate  Inverses 


One  way  of  defining  the  optimal  sensitivity  of  a  feedback  system,  and  of  addressing  these  issues  is  in 
terms  of  an  induced  norm  of  the  sensitivity  operator.  However,  it  is  shown  Zames  (1979)  that  the  primary 
norm  of  an  operator  in  a  normed  algebra  is  useless  for  this  purpose.  Perhaps  that  is  why  operator  norm 
optimization  has  not  been  pursued  extensively  in  the  past. 

Instead,  we  introduce  an  auxiliary  “weighted"  seminorm,  which  retains  some  of  the  multiplicative  pro¬ 
perties  of  the  induced  norm,  but  is  amenable  to  optimization.  Plant  uncertainty  is  described  in  terms  of 
belonging  to  a  sphere  in  the  weighted  seminorm. 

Approximate  invertibility  of  the  plant  is  one  of  the  features  which  distinguishes  control  from,  say, 
communication  problems.  We  define  the  concept  of  an  approximate  inverse  under  a  weighted  seminorm,  and  show 
that  sensitivity  reduction  is  possible  if  and  only  if  there  is  such  an  inverse. 


ALGEBRAS  OF  SYSTEMS 

A  feature  of  the  input-output  approach  is  that  systems  can  be  added,  multiplied  by  each  other  or  by 
scalars,  and  the  sums  or  products  obtained  are  still  systems;  i.e.,  causal  input-output  mappings  from  an 
algebra.  The  algebra  of  all  such  mappings  under  consideration  will  be  denoted  by  IA.  For  example,  Tk  can  be 
an  algebra  of  (possibly  multivariable)  causal  frequency  response  mappings.  I  denotes  the  identify  in  3A. 

Suppose  that  IA  denotes  the  strictly  causal  systems  in JA,  i.e.,  JAS  is  a  subalgebra  of  JA  defined  by  the 
property:  for  any  plant  p€;ftg  and  feedback  F^IA,  the  c.l.  (closed-loop)  operators  (I  +  FP)~*  and  (I  +  PF)”1 
are  well  defined.  Strict  causality  is  a  generalization  of  the  physical  property  that  (strictly  proper) 
systems  can  not  respond  instantly  to  sudden  inputs.  Mathematically,  JAt,  is  a  radical  of  IA.  However,  all  that 
concerns  us  here  is  that  the  incerses  shown  exist. 

The  subalgebra  of  stable  systems  in  will  be  denoted  by  B.  Suppose  that  a  norm,  II  «ll  ,  is  defined  on  B 
and  that  B  is  complete.  An  example  of  such  a  normed  algebra  is  the  (Hardy)  space  H°°  of  frequency  response 
£>(s)  analytic  and  bounded  in  the  rhp  Re(s)  >0,  where  the  norm  is  the  supremum  of  jp(s)|  over  the  rhp.  The 
symbol  Bs  will  denote  the  subalgebra  of  strictly  causal  systems  of  B. 

COMPENSATOR  PARAMETRIZATION 


A  feedback  system  with  strictly  causal  plant  P  and  feedback  F  in  JAg 
of  4  c.l.  operators  _ 


(I  +  FP) 


F(I  +  PF) 


-1 


-1 


P(I  +  PF) 
-1 

(I  +  PF) 


can  be  represented  by  the  matrix 


(1) 


(I  +  PF)  is  the  c.l.  response  to  a  disturbance  at  the  plant  output,  and  will  be  called  the  sensitivity 

operator.  The  initial  objective  is  to  make  this  sensitivity  small  in  some  appropriate  sense,  while  keeping 
all  4  operators  stable.  Our  first  step  will  be  to  separate  the  desensitization  problem  from  stabilization 
by  a  change  of  variable. 


Initially,  assume  P  stable.  Let  Q  4  F(I  +  PF) 


The  matrix  (1)  can  be  expressed  as 


(I  -  QP) 


P(I  -  QP) 


Q(I  -  QP) 


(I  -  PQ) 


(2) 


It  can  be  shown  that  (1-2)  are  stable  if  and  only  if  Q  is  stable.  In  c-fect,  Q^B  is  a  parametrization  of 
all  feedbacks  that  preserve  c.l.  stability.  By  designing  Q  instead  of  F  we  replace  a  potentially  unstable 
operator  by  a  stable  one  and,  more  importantly,  automatically  guarantee  closed- loop  stability;  no  need  to 
keep  checking  Nyquist's  criterion,  etc.  The  expression  for  sensitivity  assumes  the  from  (I  -  PQ) ,  which 
will  be  convenient  in  studying  the  relation  between  sensitivity  and  invertibility  of  P. 


If  a  plant  PQ  is  unstable  but  is  stabilized  by  a  feedback  F0^BS,  which  makes  (I  +  PQ  FQ)  stable, 
then  design  can  be  separated  into  2  stages:  '1)  stabilization  by  FQ;  (2)  desensitization  by  Q.  In  other 
words,  the  pair  of  operators  (F0,Q)  parametrizes  the  set  of  all  stabilizing  compensators,  in  a  manner  which 
separates  design  for  say,  low  sensitivity,  from  stabilization. 


LIMITATIONS  OF  NORMS 


For  low  sensitivity,  Q  must  be  designed  to  make  (I  -  PQ)  small.  If  P  had  a  right  inverse,  the  choice 
P  =  Q*"1  would  make  sensitivity  identically  zero.  However,  it  can  be  shown  that  strictly  causal  operators 
have  no  exact  inverses  in  B  (just  as  strictly  proper  frequency  responses  have  no  strictly  proper  inverses). 
Instead  we  try  to  make  Q  an  approximate  right  inverse  of  P,  by  making  (I  -  PQ)  small  in  some  sense.  Indeed, 
the  various  approaches  to  feedback  theory  can  be  classified  according  to  the  way  in  which  they  measure  (I-PQ) 
and  construct  this  approximate  inverse.  At  this  point  we  note  two  peculiarities  (Zames, 1979) . 

(a)  III  -  PQll  >1,  i.e.,  it  is  impossible  to  make  sensitivity  small  in  the  primary  norm  of  a  Banach  algebra. 

Ex.,  for  frequency  responses,  £{ju>)Q(jw)  +  0  as  |ajj  •*■»,  so  sensitivity  approaches  1.  It  is  possible  to  make 

ill  -  PQl!  small  if  (I  -  PQ)  is  restricted  to  a  finite  band  of  frequencies,  or,  more  generally,  an  invariant 

subspace  of  B.  However, 

(b)  the  infimum  over  all  compensators  Q^B  of  (I  -  PQ)  restricted  to  an  invariant  subspace  of  B  is  always 
either  zero  or  one.  For  non-minimum  phase  systems,  it  can  be  shown  that  if  sensitivity  is  made  to  approach 
zero  over  one  subspace  by  choice  of  Q,  then  it  approaches  infinity  over  the  complementary  subspace.  The 
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norm  of  (I  -  PQ)  over  an  invariant  subspace  is  therefore  not  a  useful  measure  of  sensitivity  for  optimization 
purposes . 

Properties  (a-b)  delimit  the  peculiarities  of  the  sensitivity  reduction  problem.  In  one  form  or  another 
they  were  recognized  in  classical  theory/  and  are  perhaps  the  reason  why  sensitivity  reduction  was  not  posed 
as  an  optimization  problem. 


Our  approach  will  be  to  employ  an  auxiliary  norm  or,  more  generally,  semi norm  (a  nonzero  element  has 
nonzero  norm,  but  may  have  zero  serai norm)  relative  to  which  (I  -  PQ)  can  be  made  small. 

WEIGHTED  SEMI NORMS 

(We  shall  consider  symmetric  seminorms  here.  In  the  general  theory,  left  and  right  seminorms  are 
employed. ) 

Suppose  that  an  auxiliary  seminorm  il*llw  is  defined  on  B,  which  will  be  called  a  weighted  seminorm,  and 
has  the  property  that  !IpIIw<IIpII  for  all  pCB,  i.e.,  I!  -II  "dominates”  Let  II  *||w  be  held  fixed  through¬ 

out.  Q  is  called  an  approximate  right  inverse  of  P  €  B  relative  to  IHIW  if  III  -  PQ||  <!|l||  The  mea¬ 
sure  of  right  singularity  of  P  is  W  w 


u(p>  4  infQ  Gb  Hi  -  pqM w 

s 

p(P)  is  a  number  between  0  and  1  which  measures  the  noninvertibility  of  P  (and  which,  for  frequency  responses 
p(jw),  usually  depends  on  the  rhp  zero  locations  of  £.) 

THEOREM  For  any  plant  P  B,  there  is  a  sequence  (of  compensators)  Fn  €  ^  ,  n  *  1,2,...,  for  which  the 
sensitivities  11(1  +  PFn)-1fl  approach  u(P),  but  no  compensator  can  give  a  sensitivity  less  than  u(P). 

i.e.,  the  optimal  sensitivity  coincides  with  the  measure  of  singularity.  Feedback  can  reduce  sensitivity 
iff  P  has  an  approximate  right  inverse. 

In  our  framework,  a  disturbance  attenuation  filter  of  the  WHK  type  can  be  described  as  follows:  If 
the  disturbance  has  a  power  spectrum  6(ju>),  the  weighted  seminorm  is  the  root-integral-squared, 

II  (I  -  PQ)II  4  (/“  |  u-$(ju)4(ju)i  s<3-.))2  auj> 1/2 

L2 


and  y(P)  is  the  irreducible  error. 


Such  quadratic  norms  present  a  difficulty  in  problems  involving  plant  uncertainty,  because  they  lack 
the  multiplicative  property  IIabII  <11  All  -II  Bll  (needed  to  make  B  into  a  "normed-algebra”) .  If  AP  is  a  plant 
perturbation  resulting  from,  say,  the  identification  of  P,  the  closed-loop  perturbation  has  the  product  form 
(1  -  PQ) AP,  and  is  difficult  to  estimate  without  a  multiplicative  property.  Indeed,  the  product  can  become 
small  even  as  its  factors  become  large.  Instead,  we  shall  concentrate  on 


Multiplicative  Seminorms 

A  weighted  seminorm  II 'll w  on  B  is  called  multiplicative  (symmetric)  iff  it  satisfies  the  following  two 
inequalities  for  all  A,  B  €b. 


IIabII  <  II All  -II ell  ,  IIabII  <  II All  IIbII 

w  w  w 

A  simple  example  of  a  multiplicative  seminorm  on  the  space  H  of  frequency  response  is  obtained  by 
taking  a  strictly  proper  weigtiting  filter  ft  t  H  with  |w(s)  |  si  in  Re(s)»  0,  and  letting 


IIpIIw  4  max|4(jui)4(  ju>)  | 


(3) 


If  p(s)  has  rhp  zeros  Zj,  then  it  is  shown  in  Zames  (1979) 
impossible  to  achieve  small  sensitivity  if  zeros  are  present  in 


that  u(P)>G(z  ^) ,  i  =  1,2,...,  so  it  is 
any  heavily  weighted  part  of  the  rhp. 


Minimax  Filters 


T 


t 


For  seminorms  of  the  form  (3),  the  optimization  problem  takes  the  form 
inf  sup  |  11  -  4( jai)^(jui)  Iw(jai)  | 

Q^B  (A) 

s 

This  problem  has  explicit  solutions  which  are  detailed  elsewhere.  Let  us  merely  give  an  example. 
Suppose  that  the  plant  £(s)4<B-s) (8+s)"1  has  one  rhp  zero,  and  G(s)  is  the  "lowpass  11  weighting  k(s+k)~l 
k  >  0.  Then  u(P)  *  w(8),  and  an  optimal  sequence  of  filters  has  the  form  (^(s)  “  Cm(s+B)  (s+m) (s+r^) 
where  m,  n,n  =  1,2,...,  are  integers  and  ctn4m2nm  (k+8)”1  is  a  constant.  The  filters  3m(s)  are  "lead"  net¬ 
works  whose  bandwidth  incerases  with  m,  and  which  approach  a  "proportional  plus  derivative"  form  as  m  +  ®. 

Our  point  here  is  not  that  minimax  norms  are  better  measures  of  cost  than  quadratic  norms,  but  that 
minimax  norms  have  the  following  advantages:  (1)  their  multiplicative  property  makes  them  amenable  to 
studies  of  plant  uncertainty;  (2)  they  enable  us  to  design  filters  which  perform  well  over  sets*  of  distur¬ 
bances  of  imprecisely  known  power  spectra;  (3)  the  resulting  optimal  filters  have  a  remarkable  resemblance 
to  the  lead- lag  filters  of  classical  control  theory. 


•There  is  a  1:1  correspondence  between  weighted  seminorms  and  convex,  balanced,  absorbing  sets  of  functions 
in  a  Banach  space. 
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MUTLI VARIABLE  SYSTEMS 

Our  objectives  here  are:  firstly,  to  show  again  that  optimal  sensitivity  is  limited  by  the  presence  of 
heavily  weighted  rhp  zeros;  secondly,  to  show  that  sensitivity  can  be  made  arbitrarily  small,  by  a  sequence 
of  filters  of  increasing  bandwidth  for  which  formulas  are  given,  provided  the  plant  matrix  determinant  does 

not  vanish  in  the  rhp,  and  does  not  decrease  too  quickly  at  high  frequencies. 

H°°N  denotes  the  algebra  of  NxN  matrices  whose  entries  lie  in  H".  The  norm  of  any  Pi{«)  is  the 

largest  singular  value  of  P(  ju>) ,  denoted  by  sup  a[P(ju)].  (i.e..  largest  eigenvalue  of  P(  jw) *P ( ja>) ) .  The 
algebra  of  systems  with  H  frequency  responses  is  denoted  by  K°“”. 

A  fixed,  strictly  proper,  scalar  weighting  filter  w(*)  e  h”  is  assumed  to  be  given,  and  the  multiplica¬ 
tive  semi  norm  II  Pil  w  *  sup  o  [$(  ju>)P(  jw)  J  is  defined. 

We  again  conclude  that  the  measure  of  singularity  satisfies  y(P)>w(s  },  s  being  any  rhp  zero  of 
detP(s).  °  ° 

THEOREM  If  det  £(s)  ¥  0  for  Re(s)  >0,  and  |o  [  P(s)  ]  |>  c  |  s  |  k  for  |s|  >  p  ,  where  c  >0,  p  >  0,  and  k  *  1,2, ... 

are  constants,  the  sensitivity  11(1  +  PF)"^II  can  be  made  smaller  than  any  e  >0  by  the  compensator. 

Q^(s)  *  P_1(s)  ln(s+n) ”1] ,n  being  a  sufficiently  large  integer. 

DISTURBANCE  ATTENUATION  UNDER  PLANT  UNCERTAINTY 

Two  opposing  tendencies  can  be  found  in  most  feedback  systems.  On  the  one  hand,  to  the  extent  that 
feedback  reduces  sensitivity  it  reduces  the  need  for  plant  identification.  On  the  other  hand,  the  less  in¬ 
formation  is  available  about  the  plant,  the  less  possible  it  is  to  select  a  feedback  to  reduce  sensitivity. 
The  balance  between  these  tendencies  establishes  a  maximum  to  the  amount  of  tolerable  plant  uncertainty  and, 
equivalently,  a  minimum  to  the  amount  of  identification  needed. 

It  can  be  argued  that  the  search  for  such  a  minimum  should  be  basic  to  the  theory  of  adaptive  systems. 
Actually,  even  the  existence  of  such  a  minimum  appears  not  to  have  been  stated,  perhaps  because  plant  un¬ 
certainty  is  so  difficult  to  study  in  the  WHK  framework  in  the  absence  of  the  multiplicative  properties,  and 
because  there  is  no  notion  of  optimality  in  the  classical  setup. 

Here,  we  would  like  to  take  a  step  in  the  direction  of  articulating  these  issues,  by  defining  the  trade¬ 
off  between  minimal  sensitivity  and  plant  uncertainty  and  deducing  its  simpler  properties.  Sensitivity  to 
disturbances  will  be  considered  in  this  section,  and  to  plant  uncertainty  in  the  next. 

Uncertainty  in  a  plant  P  e  Bs  can  be  described  by  stating  P  is  an  unknown  element  of  some  prescribed 
"ball"  of  plants,  denoted  by  b(P^,6),  centred  at  some  nominal  plant  Pj,  and  of  radius  6  >  0,  i.e.,  Hp^-pII^S. 
For  example,  for  a  frequency-response  £(jw)  this  means  that  P(jw)  lies  in  a  band  of  radius  6  centred  around 
P1(ju)) .  This  description  of  uncertainty  may  be  cruder  than  a  probabilistic  one,  but  has  the  pragmatic  advan¬ 
tage  of  being  tractable  under  the  nonlinear  feedback  transformation. 

Let  IMIW  he  a  given  multiplicative  seminorm.  If  the  plant  is  in  the  ball  b(P^,6),  we  define  the  optimal 
sensitivity  under  plant  uncertainty  to  be 

n(P.,6)  =  inf  sup  11(1  ♦  PF)  *11 
F  p€b(P1,6) 

the  inf  being  over  all  F  £  *s  that  maintain  c.l.  stability;  i.e.,  n  is  the  optimized  worst-case  sensitivity. 
We  obtain  the  following  result. 

THEOREM  For  any  nominal  plant  P^  G  Bg  and  radius  of  uncertainty  5  >  0,  the  optimal  sensitivity  n(P^,6)  is 
a  monotone  nondecreasing  function  of  6,  which  approaches  the  singularity  measure  y(Pi)  as  6  -►0,  and  equals 
II  III w  for  6  *  NpiNw* 

In  other  words,  the  less  information  we  have  about  P,  the  less  we  are  able  to  design  a  feedback  to 
attenuate  disturbances. 
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FILTERING  OF  PLANT  UNCERTAINTY:  INVARIANT  SCHEMES 

Finally,  we  turn  to  the  problem  of  a  plant  P  in  a  ball  b(P^,6)  of  uncertainty  of  radius  &  >  0  centred 
at  a  nominal  value  P^,  and  consider  feedbacks  that  shrink  the  uncertainty.  Any  linear  feedback  arrangement 
for  a  plant  that  has  only  one  (possibly  multivariable)  accessible  input  can  be  parameterized  by  two  opera- 
tors  (F,  R)  as  shown  in  the  following  flow-graph. 


Fig.  2 

Of  course,  uncertainty  can  be  reduced  to  zero  by  disconnecting  the  input  u  from  the  system,  but  then 
Pi,  is  also  transformed  into  zero.  Clearly,  the  problem  is  trivial  unless  there  is  a  normalization  or  con¬ 
straint  on  the  control  law  that  transforms  Pi  into  a  closed- loop  system. 

We  would  prefer  as  far  as  possible  to  separate  the  reduction  of  uncertainty  from  the  transformation  of 
P^,  and  therefore  seek  a  definition  of  uncertainty  which  is  independent  of  the  eventual  closed- loop  system. 

If  Pi  were  a  real  number,  uncertainty  could  be  normalized  by  specifying  it  as  a  percentage  of  the  nomi¬ 
nal  value.  This  possibility  is  not  open  for  noninvertible  plaints.  Instead,  we  achieve  a  normalized  defi¬ 
nition  of  uncertainty  by  employing  the  device  of  a  "plant-invariant  scheme":  i.e.,  a  scheme  which  leaves 
the  nominal  plant  invariant  but  shrinks  the  radius  of  uncertainty.  Such  a  scheme  is  shown  to  be  always 
realizable  in  the  form  of  the  following  "model  reference  scheme": 


d 


Fig.  3 

This  device  also  enables  us  to  separate  the  design  process  into  two  consecutive  stages:  (1)  reduction 
of  uncertainty,  and  (2)  transformation  of  the  nominal  plant  into  a  nominal  c.l.  system.  (cf.  the  separation 
into  »stimation  and  control  stages  in  Kalman  filtering.) 

Suppose,  then,  that  the  feedback  scheme  is  (normalized  by  being)  plant  invariant,  and  for  simplicity 
here  say  that  R  =  0.  Let  If  *llw  be  a  given  multiplicative  seminorm. 

For  a  plant  P  in  a  ball  of  uncertainty  b(P^,6)  the  optimal  sensitivity  to  plant  perturbations  is  defined 

to  be 

v(p  ,6)  ■=  T  inf  sup  l|P(I  ♦  FPl"1  -  P,(I  +  FP,)'1|I 

1  6  F  pEblP^!)  1  1  W 

THEOREM  For  any  nominal  plant  Pp  in  Bs,  the  infimal  sensitivity  to  plant  perturbations  v(Pp,5)  is  a  mono¬ 
tone  nondecreasing  function  of  6  >  0.  v(Plf5)  satisfies  the  upper  bound  conditions  u(P1,6)  <  nlP^.S)  and 
lim  v(P. ,6)  <  u(PL>  . 

6-K) 

Again,  there  is  a  tradeoff  between  uncertainty  about  the  plant  and  sensitivity  reduction  achievable  by 
feedback. 
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FEEDBACK  AND  MINlMAX  SENSITIVITY 


*  *  • 

George  Zames  and  bruie  A.  Francis 


♦Department  of  Electrical  Engi neer ing. 
Street,  Montreal,  P.o.  H3A  2A7  CANADA 
Engineering,  Ifni vor.sity  of  Waterloo, 

ABSTRACT 

In  this  [  d|  "i  ,  we  look  for  feedbacks  that  mini¬ 
mize  the*  sensitivity  function  of  a  linear,  singlo- 
v.iriai  le  feedback  system  represented  ;.y  its  fre¬ 
quency  responses.  Sensitivity  is  measured  in  a 
weighted  II *  norm. 

In  an  earlier  paper,  Zames  proposed  an  apj  roach 
to  f.*.*dback  design  involving  the  measurement.  «>f 
sensitivity  by  "multiplicative  seminorms" ,  which 
have  cett.uin  advantages  over  the*  widely  used  quad¬ 
rat  ic  ik  rn  m  i  roblems  where  there  is  plant  unor- 
tainty,  or  where  signal  lower- spectra  are  not  fixed 
cut  b“lon,.j  to  s*  ts .  The  {  roblem  was  studied  in  a 
general  setting,  and  soir-  IP  examj  bs  w**r>-  solv  i. 

h* •  r** ,  a  detailed  study  of  th**  single- variab  b» 
ca  •  *  is  undertaken.  The  results  arc  extended  to 
u:i"t  » ;  1  •  ;  lants,  and  explicit  formulas  f-r  the  n- 
•  •tu;  ltuition  of  a  finite  number  of  RHP  plant  .•  i*-. 
or  ;>i.  >  ar**  provided.  The  *,?  or  "appr  ix in.tL'  -in  - 
v-  r:;e"  |  ,.ir  »m>*t.  r  i /.at  ion  <f  feedbacks  that  maintain 
closed-  loop  stability  is  extended  to  th'?  case  <>f 
uxrd  a:  1*  ;  lants.  The  M  and  Wiener-Mopf  afproaehesi 

ar*  corns ared. 


1.  INTRO DUCT  ION 


'This  is  a  .summary,  with  proofs  omitted,  of  a 
report  obtainable  from  th«>  authors.) 

In  t  hi-  |  i  j  e  r  w.  study  the*  linear  servo  prob¬ 
lem,  and  -.*  “k  a  feedback  F  (see  Fi*j.  1)  to  minimize 
the  sensitivity  to  output  disturbances  d,  at  the 
output  y  of  >i  linear  singb*-vai  table  plant.  P  charac¬ 
terized  1/  its  fr*  pien  -y  r«*r,jrfni-.** .  Sensit  ivity  is 
measured  m  a  weigr.ted  M  (operator)  norm. 

■;:;•*  re  ar'*  sevi.tl  reasons  for  our  interest,  in 
H ''  m f  t  ion. 

1)  Sr'M.lvm-.  involving  variable  power  spectra 
ar**  not  usually  tractable  Ijy  quadrat ic  opt  i  mizat  ion 
methods,  but  ar**  tractable  by  H°  method;?. 

2)  H  methods  yield  filters  with  the  "broadband" 
quality  t.hat  often  distinguish**:;  r-lassical  frequency 
r**:ij  1 1  * ;  <  •  from  Lr)G  <l<v.  wns,  and  offer  t  h'*  possibility 
oi  .»  b  arer  mat.hemat  ic  al  basis  for  such  udhoo  clas- 

-i  .il  |  rs-  e<|ur>>;  as  lead- lag  t  otnj  etisat  ioti 


McGill  I’iuv't  i  f  y ,  i>JHc  e  .j  v,.r  ;Jty 
* ‘Department  of  Klectri  al 
Waterloo,  r.nr,irm,  M21.  b;l  'ANAPA 

3)  Problems  involving  i  1  ini  perturbations  or 
uncertainty  are  most  naturally  studied  under  norms 
with  multiplicative  properties  of  ty-  t_yj  *• 
lib'll  f  II  r|l  *|lc>ll  ,  which  If’  norm  .  h.ive  but  quadrat  i. 
norms  do  not,  and  whicn  allow  the  norm  of  a  ;  roduct 
of  *■  ratty,  for  functions  t->  r ■  ]  i»H  to  the  n  rms  of 
the  factors. 

Tn  Zunes  [1]  some  of  *  mj:,  fe.ifur*-  d  ..,-jr 
api  roach  were  outlin'  d  in  i  :•  *,<  r  1 1  .•  »*  :r  i,  md 

examples  were  solv!  i  nv*  1  v:r:  :  r  i ;  •  ;  var  ;  »:  i*  ;y 

•■•■ms  with-  ut  HIP  (riTb*  » !  f  -  -  )  ...  r  ,  ,r-j  •  in- 

•Jle  viriai  !•  ny-ters  a;.  ,  p.g  r-f,  *i»n 

■■mphasi:;  on  ;lant  ;  *  ■ .  i  -  ,*  •  ;  r  ■  iv  :-  n 

rftal  iliZ'  d.  Her*  .  w  • •  |.  j  j  *  i  iv  >f 

t  l.e  i  ngb*- van  .*•  1.  .i  .  .•  »•.  ,•  .•  ,<f  :  ;  i.*.t 

un'-.-rt  .i  1 1;‘  .  ,  ex**-:,  i  ♦  *  i  i.*  ;  i  u,r  ,t 

and  provide  •xt  U"it  t  i-b  ?  r  :•  :•  r  « :  c  .»•.*• 

of  a  f  i  ri :  i  *  nun:.'*  r  of  l-i,:  -  It:.*  •  r  r  :*jb  >. 

It  i  1  ]  *  :.*  r.  •  1;  a  .  •  .  ,*  :r..i  i  r.t  » :  n  1 

stability  were  j  a  r  ir>  r  i  .*•  1  :  j  *  *:  b  ys*  ■  n-vm- 

ible  o,  wtiichi  w  is  i.  *•  r;  t  ■  *  ,  ,  u-  J:  .  r  xiinat..  in¬ 
verse.  Hire,  we  *  xt**nd  •  ..  :  »r  ir-  t.  r  izat  imi  to  un¬ 

stable  «  1  ant  ,  and  inter;  r-*t  .  as  a  onstrain**d 
approximate  inverse. 

The  H  and  Wiener-Mo;  f  af f  roaches  are  conpared 
lu  an  examp b-*. 

Our  results  are  an  application  of  the  theory  of 
optimal  interpolation  devloj-ed  early  in  the  century 
by  wirtheodory,  Sehur.  ri>k,  N«  vanlinna  and  others 
(see  Walsh  [2,  Chapter  !••)).  An  exposition  of  op¬ 
timal  II  interpolation,  involving  the  funct. ional- 
analy'ic  concept  of  duality,  was  develop'd  several 
decades  later  by  Rogos inski  and  Fhapiro  [3],  and  is 
outlin«?d  in  Duron  (4). 

The  g-parametrizatior.  is  related  to  the  well 
known  results  of  Youla-Bongiorno-dabr  [(>1  ,  and  the 
interesting  generalizations  recently  obtained  by 
Desoer-Li u- Mur ray-. Sacks  |7J,  for  roj rime  factorable 
plants  and  feedbacks.  The  Q-j  aramet. r i zat i«>n  docs 
not  preassume  coprim#*ncss.  it  is  convenient  here 
because  it  is  affine  in  Q,  and  displays  the  rela¬ 
tionship  between  sensitivity  and  invert i bi 1 lty. 

Safonov- At bans  [8}  and  Stein- Doyle  I'M  have 
employed  maximum  sensitivity  constraints  in  quad¬ 
ratic  optimization  problems,  and  our  motivation 
here  overlaps  theirs. 
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1 1 .  PROBLEM  PORMULAT ION 


The  feedback  system  of  Fig.  1  consists  of  a 
plant  and  feedback  "compensator",  whose  input-output 
behavior  is  represented  by  the  frequency  responses 
PI:;)  and  F(s)  respectively.  The  CL  (dosed- loop)  be¬ 
havior  of  tins  system  is  represented  by  2><2  matrix 
of  frequency  rosj»onsos 


IK.  ]“  .  ,  A 
il  1  ,  1  =1 


(1  t  FI')  1  I’(l  •  FI')"' 


III  I  PI')’1  (1  +  FF)"1 


(2.1) 


In  particular,  the  response  of  the  output  y  to  a 
d i  t  ui banco  d  is  K??  -  (l+FP)'1.  This  term  appears 
li.  mist  expressions  for  sensitivity,  whether  to  add¬ 
itive  disturbances  such  as  cl,  or  to  perturbations  in 
th-  i  l.mt  P.  We  shall  rail  K->2  the  sen  s  i  Li  v i t  y  f  u  nc  - 
our  objective  will  bo  to  optimize  certain 
measures  of  this  sensitivity  function  while  keeping 

the  K . .  stable, 
i  ) 

A  f r-  'pi»»r>  ‘y  response,  for  our  purposes,  is  any 
function  d(s7  of  a  complex  variable  s  A  d  *  j<u,  which 
is  rt«: r*  imorj  hi*:  in  the  open  WIP(right  half-plane) 

Re(s)  '<>,  i.e.,  is  analytic  in  He(s)  >1),  except,  per- 
hat  s,  at.  a  countable  number  of  poles. 


A  frequency  res{xjn«;e  G(s)  is  proper  iff  the 
licit.  1  im^  sup  ,  t;  .  >(J  pits)  j  is  finite,  and 

st r i  t  ly  ;  roper  iff  th*’  limit  is  zero.  G(s)  is 
st’.1!'1  iff  it  is  bound'Hi,  i.e.,  |r»(  s)  1  ff*onst . ,  and 
analytic  in  Pe(s)  > 0. 


If  r,j  and  l">2  ar<!  stable  frequency  responses 
and  c  is  anv  Comdex  constant,  then  the  sum  Gj+G^ 
and  i  relucts  ' j  > ;  >  u,  1  cG,  ap-  all  stable  frequency 
»■ .  :  -  s  -,  i.e.,  •-.table  frequency  res;  «>iis«*s  form  an 

a  1  pda  a  «>v»r  th*-  fi**ld  of  complex  numbers.  The  norm 
,.my  ..ruble  frequency  re  upon  ;»»  G  is  defined  to  be 
II  n'l  -  ;u:  [g(s)I.  Under  this  norm,  the  alge- 

R* ■ ( s )  *  <> 

bra  of  stable  frequency  responses  is  called  the  Hardy 
of  order  ”,  and  denoted  by  II'”  .  It  can  be 
shown  that  tie--  H‘"  norm  can  l>e  found  from  the  magni¬ 
tude  G(s)  along  the  jw-axis,  i.e.,  II  till  -  sup  |G(  ja>)  |. 

1  •*> 

The  strictly  proj-or  frequency  responses  m  «  form 
a  subalqebra  denoted  by  li  . 

It  will  be  assumed  throughout  that  P(s)  and 
p  f  ■ ; )  af  f  requs-ncy  responses  such  that  P  (s)  f’(s)  is 
not"  ident icia 1 ly  equal  to  - 1  (which  will  certainly 
he  true  if  I-(s)  and  F(s)  are  strictly  proper).  This 
assumjtion  ensures  that  all  the  inverse*;  in  (2.  Dare 
wi- 1 1-def ined  frequency  responses.  Furthermore,  it 
w i  11  be  assumed  that  P(s)  is  the  sum  of  an  Ht>>  func- 
l i.M  and  a  proper  rational  function  (so  that  P(s)  is 
f  f  *  r)  .4ftd  Mia t  Pin)  has  at ” mo «J t  a  finite  number  of 
i  ■  .•  1  ■  r  and  zeros  in  Ho  (s)  0.  " 


,■  A  .  Kr  .jet  i  •  <nri  1  Ti  ati -.format  ion 

W*  snail  employ  a  t  rate;  format  i  on  which  sorvos 
t-j  |  «ir«im»*t  ri  z«*  the  feedbacks  that  keep  th**  CL  system 
'■  t  ibb  -,  i rid  thereby  ensures  a  stable  design.  This 
tr-ur  format  i'»fi  w  e  introduced  in  |1J  for  stable 
;  lints,  arid  is  extended  here  to  unstable  plants.  U’t 


Q  l*e  a  new  variable  defined  by  the  fraciotnal  trans¬ 
formation 

o  =  F(1  I  PF)"1  (2.2a) 

The  hypothesis  that  p(s)F(s)  $  -]  ensures  that  (2.2a) 
is  well-defined.  Moreover,  P(s)Q(s)  $  1,  and  F(s) 
can  be  recovered  from  Q(s)  by  the  reverse  transfor¬ 
mation 

F  =  0(1  -  P0)'1  *  (2.2b) 


and  vice  versa.  W>>  shall  view  Q(s)  as  a  representa¬ 
tion*  of  the  feedback  F(S) ,  which  is  useful  for 
design  purposes.  In  terms  of  Q,  (2.1)  assumes  the 
form 


[K  .  ]  = 
>  J 


(1  -  QP)  I’(l  -  QP) 
Q  (1  -  PQ) 


(2.  3) 


If  the  matrix  (2.3)  is  stable,  then  Q  is  stable. 
Conversely,  any  choice  of  stable  Q  makes  (2. 3) stable 
provided  that  W  satisfies  the  following  constraints. 
Cl :  PQ  and  (1  -  PQ)  P  arc  both  stable. 

By  using  Q,  subject  to  these  constraints,  as  a  de¬ 
sign  variable  instead  of  F,  we  replace  a  potentially 
unstable  design  variable  by  a  stable  one  and,  more 
importantly,  automatically  ensure  closed-loop  stabi- 
1 ity ,  without  further  recourse  to  stability  tests. 


If  P  is  stable  to  begin  with  then  Cl  is  satis¬ 
fied  automatically .  Otherwise,  cl_  is  a  condition  for 
Q  to  represent  a  stabilizing  feedback  for  p.  For 
stable  n,  ci_  is  equivalent  to  the  condition  that 
C2 :  At  any  pole  a  of  P(s)  of  order  m  in  Re(s)  2  0, 

(1  -  P(s)Q(s))  must  have  m  or  more  zeros. 


That  CJ_  implies  C2_  is  clear.  Conversely  sup¬ 
pose  that  C2  is  true.  Then,  as  Q  is  bounded  in 
Ko(s)  >0,  neither  (1  -  PQ)r  nor  (1  -  PQ)  can  have 
any  poles  in  Re(s)  *0,  except  possibly  at  the  poles 
of  p,  whore  in  fact  both  arc  now  finite.  Both  (l-PQ)P 
and  (1  -  PQ)  must  therefore  be  bounded  and  analytic 
in  Re(s)  >0,  and  therefore  stable,  i.e.,  Cl_  is  true. 

Observe  that  under  C£,  Q  has  precisely  m  zeros 

at  a . 


.!  B .  Be  ha  v  i  or  near  •*>;  Weightings 

Some  notation  for  limiting  the  growth  or  de¬ 
cay  of  frequency  responses  as  s  -►  will  bo  adopted. 

Notation  A  frequency  response  G  is  of  i nforior  order 
0(-fc)  at  ”,  Z  *  0,  iff  for  some  R  >  0, 

|G(s) I  >  const  J  s  J  1  (2.4) 

for  | s |  >  R,  Re(s)  *  0.  The  statement  that^G  is  in 
(1  +  s)  II*  means  that  (1  *  s)’^G(s)  is  in  H  . 

2C  •  The  Problem 

The  main  objective  of  this  paper  is  to  solve 
the  following  problems;  Suppose  that  the  plant  P(s) 
and  a  weighting  W(s)  are  fixed,  proper  frequency 
responses,  not  identically  equal  to  zero,  and 

1)  p(:»)  is  the  sum  nf  an  H  function  continuous 

•Observe  that  as  l’(s)  i  proper,  (2 ,2a-b)  map  any 
strictly  proper  F(s)  into  a  strictly  proper  Q(s), 
and  vice  versa. 


» 
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in  Re  (s)  J  0 ,  and  a  proper  rational  function;  P(s) 
has  q  poles,  a^^.a^,  and  r  zeros,  in 

Re ( s)  >0,  (q,  r,  finite),  each  enumerated  accord¬ 
ing  to  its  multiplicity;  no  poles  or  zeros  on  the 
)u>-axis;  and  P(s)  has  inferior  order  0(-k)  at  00 . 

2)  w(s)  is  in  H  ,  is  continuous  in  Re(s)  i  0, 
has  norm  llwfl  (  1,  has  no  zeros  in  Re(s)  Jr  0,  and  is 
of  inferior  order  0(-i.)  at  «. 

Problem  1 :  Find  a  strictly  proper  feedback  frequen- 
cy  response  F(s)  to  minimize 

II W  ( 1  +  PF)-1||  (2.5) 

subject  to  the  constraint  that  the  CL  matrix  (2.1) 
is  stable.  As  the  value  of  (2.5)  is  identical  to 
that  of 

II W(1  -  PQlll  (2.6) 

an  equivalent  problem,  which  is  easier  to  solve  be¬ 
cause  (2.6)  is  affine  in  Q  whereas  (2.5)  is  non¬ 
linear  in  F,  is;  Problem  2.  Find  a  stable  strictly 
proper  Q  in  H0  to  minimize  (2.6),  subject  to  the 
constraint  that  Q  stabilize  the  CL  system,  i.e„, 
that  conditions  Cl  or  C2  hold. 


The  expression  (2. 5-2.6)  will  be  called  the 
sensitivity.  The  optimal  sensitivity,  denoted  by 
p (P),  is  defined  by  either  one  of  the  equivalent 
expressions 

U(P)  =  inf  II W ( X  +  PF) _1ll  (2.7) 

F 

=  inf  II W(1  -  PQ)II  (2.8) 

Q 

where  the  infima  are  taken  over  all  strictly  proper 
frequency  responses  F,  or  over  all  Q  in  Hq  satisfy¬ 
ing  the  constraints  Cl^  or  C2. 

Remarks 

w  generates  an  auxiliary  norm  II  *11  ^  on  the  alge¬ 
bra  of  stable  frequency  responses,  defined  by 
I)gJIw  4  JJwgJI,  and  our  object  therefore  is  to  minimize 
this  auxiliary  norm  of  the  sensitivity  function, 

11(1  +  PD  " w*  The  auxiliary  norm  has  the  "multi- 
pi  icative'1  properties, 

II  g^  f  IlGjfl  IIg21Iw>  IIg1g2IIw  j  IIGjIIJIg^I  (2.9) 

It  was  pointed  out  in  (1)  that  inequalities  (2.9) 
are  important  in  the  study  of  plant  perturbations  or 
plant  uncertainty,  but  are  not  valid  for  the  quad¬ 
ratic  norms  used  in  LQG  methods.  One  of  our  main 
reasons  for  pursuing  the  presented  weighted-H 
approach  is  that  it  can  be  extended  to  problems  of 
plant  uncertainty,  along  the  lines  of  [1J,  although 
plant  uncertainty  will  not  be  considered  here. 

2E.  Relaxation  of  Propriety 

The  optimization  problem  is  simplified  if  the 
propriety  constraints  on  Q  and  the  other  CL  responses 
of  the  matrix  (2.2)  are  relaxed.  It  will  be  conven¬ 
ient  initially  to  solve  the  simpler  problem  (in 
Sect.  4),  and  later  in  (Sec.  5)  to  modify  the  im¬ 
proper  solution  obtained  for  Q,  by  inserting  some 
high  frequency  attenuation,  and  so  obtain  a  family 
of  strictly  proper  solutions  to  the  original  problem. 
The  relaxed  problem  is  defined  as  follows. 


B 

Let  H  denote  the  algebra  of  functions  G(s) 
which  are  analytic  in  Re(s)  >0  and  bounded  on  every 
half-disc,  jsj  <  R,  Re(s)  >  0,  of  finite  radius  R>0. 

Problem  2°:  rind  a  (not  necessarily  proper)  Q  in 
H  to  minimize  (2.6)  subject  to  the  constraint  that 

C3  (1  -  PQ)P  is  In  hB.  (Note:Q  €  H8,  (1  -  PQ)£  H8 

—  pq£hb.) 

This  problem  is  divorced  from  any  consideration 

of  F. 


III.  ALLOWABLE  SENSITIVITIES  AND 

FEEDBACKS:  INTERPOLATION  CONSTRAINTS 

The  optimization  problems  2  and  2°  will  be 
transformed  here  into  optimal  interpolation  problems. 
In  partucular,  the  constraints  C2-C3  on  Q  will  be 
expressed  as  interpolation  constraints. 

Suppose  that  P  and  W  are  fixed  as  in  Sect.  2C 
and  Q(s)  is  in  HB.  We  shall  wish  to  characterize 

those  Q(s)  for  which  P(s)Q(s)  is  in  HB,  and  observe 
that  there  are  three  equivalent  ways  of  character¬ 
izing  Q(s)  : 

1)  Q(s)  has  zeros  at  the  poles  of  P(s)  in 
Re(s)  >  0,  taking  into  account  their  multiplicities; 

2)  Q(s)  satisfies  the  interpolation  constraints 

Q(s>  =  0,  dQ/ds  -  0 .  (d/ds)m~^Q.O  (3.1) 

at  each  distinct  pole  of  P(s)  j£  multiplicity  m  in 
Re  (s)  >  0; 

3)  the  ratio  Q(s)/Bp(s)  is  in  H®,  where 

q  a.^  -  s 

B  (S)  =  „  (— ~)  (3.2) 

p  i=l  ai  s 

(recall  that  a,,..., a  are  the  Re(s)  >0  poles  of 
P(s) .)  1  q 

A  rational  function  of  the  form  (3.2)  is  called 
a  Blaschke  product.  In  Engineering  terminology,  a 
frequency  response  is  called  allpass  if  its  magni¬ 
tude  is  constant  a.e.  on  the  jw-axis.  As  B  (jw)=  1, 
Blaschke  products  are  allpass.  p 

Constraints  induced  by  RHP  Pi  amt  Poles 

A  (possibly  improper)  frequency  response  Q(s) 
in  Hb  satisfies  the  condition  C3^  for  the  CL  system 
to  be  in  HB  iff  (1  -  P(s)Q(s)J  has  zeros  at  the  RHP 
poles  of  P(s) ,  taking  into  account  their  multiplici¬ 
ties.  There  are,  again,  several  equivalent  ways  of 
expressing  C3,  namely 

1)  Q  satisfies  the  interpolation  constraints 

(1  -  P(a)Q(a)J  =  0 

j  (3.3a) 

(d/ds)m  ^(1  -  PQ)  =  0  at  s  *  a 

at  each  RHP  pole  a  of  P(s)  of  multiplicity  m. 

2)  A  compact  way  of  expressing  (3.3a)  is  to 
say  that  Bp  divides  (1  -  PQ)  in  HB,  i.e., 

(1  -  PQlB^e  H8  (3.3b) 
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or  that 

3)  W(1  -  PQJb”1^  HB  (3.3c) 

(3.3c)  is  equivalent  to  (3.3b)  because  W  and  W  *  are 
in  hb,  the  latter  because  W(s)  has  no  zeros  in 
Re  (s)  *  0. 


The  conditions  for  stable  Q(s)  to  stabilize 
P(s),  namely  C1-C2,  are  equivalent  to  the  require' 
ment  that  the  expression  in  (3.3b)  be  in  H®  or  that 
(3.3a)  hold. 

3A1 .  Prop  Suppose  there  exists  a  function  Qq (s)  in 
H0  which  stabilizes  P(s);  then 

a)  Q^ts)  can  be  expressed  in  the  form  Q0 (s)  = 

Q  m(s)B  (s) ,  where  Q  is  in  H  ; 
oA  p  oA  _ o 

b)  The  functions  Q  in  H  which  stabilize  P(s)  are 
those  and  only  those  which  have  the  form 

«  ■  VoA  +  BpC1  (3'41 

where  is  in  H®. 

c)  More  generally,  the  functions  Q(s)  in  Ha  for  which 

(1  -  PQ )  1  is  in  (or  H°°)  are  those  and  only 

those  which  can  be  expressed  in  the  form  (3.4),  with 
Q^ts)  in  Hb  (and  such  that  P(s)Q(s)  is  proper). 

Remark  There  is  a  connection  between  Proposition 
3A1  and  Theorem  3  of  (7J,  which  is  elaborated  in  the 
original  report. 

3B .  Constraints  Added  by  RHP  Plant  Zeros 

We  are  interested  in  optimizing  the  weighted 
sensitivity  function  W(1  -  PQ)  which  will  be  denoted 
by  X  and  is  related  to  Q  by  the  equations 

X  =  W ( 1  -  PQ)  (3.5) 

2  -  P'1d  -  W_1  X)  (3.6) 

Let  ft  denote  the  set  of  those  Q(s)  in  HB  that 
satisfy  the  pole  constraints  (3.3)  and  for  which  WPQ 
is  proper.  If  Q  is  in  ft  and  X  is  given  by  (3.5), 
then  it  follows  from  (3.3)  and  the  propriety  of  WPQ 
that  both  X  and  XB"1  are  in  H®.  From  (3.5)  it  is 
clear  tnat  X  must  satisfy  the  following  plant-zero 
interpolation  constraints; 


X  (b)  -  W  (b) 

(d/ds)™'1  X  *  (d/ds)m_1W  at  s  -  b 


(3.7) 


as  an  extreme  case  of  the  former. 

The  minimal  sensitivity  y(P)  can  be  interpreted 
as  a  measure  of  the  constrained  singularity  of  P. 


IV  IMPROPER  MINIMIZATION 

W«  turn  now  to  the  optimization  problem  2°,  of 
finding  the  minimum 

U(P)  =  inf  {INI:  X  =  W(1  -  PQ)}  (4.1) 


under  the  relaxed  constraint  that  Q(s)  is  a  possibly 
improper  function  in  HB  which  satisfies  the  plant- 
pole  constraints  (3.3).  (Equivalently,  Q  is  in  ft). 
Here,  P(s)  and  W(s)  are  the  fixed  functions  definec 
in  Sect.  II,  and  it  will  be  assumed  that  P(s)  is 
either  strictly  proper  or  has  at least  one  RHP  zero. 


X  can  be  minimized  first,  subject  to  (3.7),  and 
the  optimal  Q  calculated  second  by  (3.6).  Furthermore, 
as  lixll  =  !lXB“l||,  t^e  minimization  of  X  can  be  accom¬ 
plished  by  minimizing  XB”*-  and  multiplying  the  re¬ 
sult  by  B  . 

P 

Lemma  1  (a)  There  ls  a  unique  weighted  sensitivity 

function  X  W(1  -  PQ)  in  H  of  minimum  norm.  The 
necessary  and  sufficient  conditions  for  X  to  be  that 
function  are  that  X  satisfy  the  interpolation  con¬ 
straints  (3.7),  and  have  the  (allpass)  form 
m  c.  -  s  q  a.-s 
i(a)  =  D  n  <5—^1  n  (4.2) 

1*1  l  j-1  3 

in  which  Refc^  *  0,  D  is  a  constant  satisfying 
|P[  =  llXlt,  and  m=r-l.  — 

(b)  The  optimal  3  _i£  $  -  P-1(l  -  W~L  X),  and  belongs 
to  (1  +  s)k+iH". 

(c)  If  the  plant  P(s)  and  weighting  W(s)  have  con¬ 
jugate  symmetry,  then  so  do  X  and  Q,  and  the  co¬ 
efficients  in  (4.2)  are  real  or  occur  in  conjugate 
pairs. 


V.  MINIMIZATION  OVER  STRICTLY  PROPER  Q 

Suppose  that  P(s)  and  W(s)  are  fixed  as  in  Sect 
2C,  but  will  be  assumed  to  have  conjugate  symmetry. 
Q(s)  and  X(s)  are  the  extrema  specified  in  Lemma  1. 
and  Q  (s)  is  any  function  in  h®  representing  a 
strictly  proper  feedback  that  stabilizes  P(s).  Let 
^ 3 ^  *  n  “  1,2,...,  be  defined  in  one  of  two  ways: 


at  each  distinct  plant  zero  b  of  multiplicity  m. 

Remarks  on  Stabilization  and  Desensitization  vs. 
Inversion 

To  achieve  a  small  sensitivity  W(1  -  PQ) ,  PQ 
must  be  close  to  1,  i.e.,  Q  must  act  as  an  approxi¬ 
mate  inverse  of  P  subject  to  the  plant-pole  con¬ 
straints  (3.3).  Now  (3.3a)  means  that  only  those 
approximate  inverses  are  allowed  which  are  exact  at 
the  RHP  poles  of  P(s).  Equivalently,  sensitivity  is 
made  small  subject  to  the  constraint  that  it  is  zero 
at  the  plant  poles.  From  this  viewpoint,  the  prob¬ 
lems  of  sensitivity  reduction  and  stabilization  by 
feedback  are  related,  and  the  latter  can  be  viewed 


Case  1.  If  W(s)  is  strictly  proper, 

Qn(s>A  Q0(s)  +  (Q  -oj  Is)  In (s  +nf  1]k+l+1  (5.1) 

Case  2.  If  W(s)  is  proper,  and 

lim|s|..,  Re(s)  )0|M(S|  1  *  I-  then 

Xln(S)4X(s)[l  -  Bz(s)n^l,  1  .  — ">--W<-)  (5.2a) 

B z(-)X(») 

§ln<s>  Ap~1(s>ll  -  i1J(.)»-1(s)l  (5.2b) 


1 
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0n(s)4  50(a)  +  lCln<s)  I  lmn(s +mn)'1)k  +  l  (5.2c) 

where,  for  any  n  >  0,  >0  is  a  sufficiently  large 

integer. 

In  either  case,  the  corresponding  sequence  of 
feedbacks  is  given  by  the  fractional  transformation 

Fn(s)  -  0n(s>(l  -  PfsjjMs)]"1  (5.3) 

If  the  numerator  and  denominator  of  (5.3)  are  divid¬ 
ed  by  Bpl(s),  (5.3)  can  also  be  written  as 

F  (S)  =  6  (s)B'l(s)(|l  -  P(s)6  (S)  )  b”1  (3)  r1 
n  n  p  n  p 

(5.4) 


problem  falls  on  the  calculation  of  the  factor  of 
(4.2)  , 

r-1  c.  -  s 

X  ( 3 )  -  D  n  (=i— )  (5.10) 

B  (S)  i=l  Ci  3 

P 

whose  coefficients  are  not  known  priori. 

Assumption  (For  simplicity)  suppose  the  plant  zeros 
b^  are  distinct. 

By  Lemma  1,  XBp*  is  unique,  and  by  Prop.  3B1  is 
determined  by  the  interpolation  constraints  (3.7), 
which  take  the  form  of  r  equations 


Theorem  1  The  functions  Qn ( s )  defined  by  (5.1)  in 
Case  1,  or  (5.2)  in  Case  2,  are  (strictly  proper) in 
H  ,  stabilize  P(s) ,  and  are  optimal  in  the  sense  of 
producing  sensitivities  approaching  the  infimal 
value  u (P) *  i.e77 

lim  II W  ( 1  -  PQn)ll  =  y(P)  (5.5) 

n-**» 


X(b.)  .  r-1 

w4V  ' 


-  b. 

] )  =  e . 

c.  +  b  1  3 

1  1 


(5.11) 


where  0j  4  w (b j) /Bp (bj )  are  complex  constants  de¬ 
pending  on  the  values  of  the  weighting  W(s)  at  the 
RHP  plant  zeros,  and  the  location  of  the  plant  poles 
in  relation  to  these  zeros. 


Similarly  the  feedbacks  F  (s)  are  in  H°  and  ootimal. 

T7e . - -  -  0 - - - 

lim  I|W(1  +  F  P)  _1||  =  y(p)  (5.6) 

Moreover,  p(P)  -  f|  xll . 


5A.  Bounds  on  the  Minimal  Sensitivity 

5A1  Prop,  a)  y(P)  )  max  |w  (bi>  B~X  (bjL>  |  (5.7) 


>  max  |w(bi) | 
i 


(5.8) 


assuming  P  is  unstable. 


i . e . ,  Bp  $  1,  in  (5.8) . 


b)  If  P^  and  P2  are  plants  with  identical 
RHP  zeros,  PT  is  stable,  and  P^  has  poles  in  the  RHP, 
then  MP2)  >  u(PL)tf  ” 


Remark ;  It  follows  from  (5. 8-5. 7)  that  small  sensi¬ 
tivity  can  not  be  achieved  if  there  are  zeros  in  any 
heavily  weighted  part  of  the  RHP,  or  poles  anywhere 
near  these  zeros  (since  Bp(bj.)  is  smallest  near  the 
RHP  poles  of  P(s) ) .  Prop  5A1  shows  that  the  inser¬ 
tion  of  unstable  poles  in  Re(s)  >  0  into  an  other¬ 
wise  stable  system  always  deteriorates  the  achiev¬ 
able  optimal  sensitivity. 

5B  Determination  of  X  and  Q 

The  plant  P(s)  can  be  expressed  uniquely  as  the 
product 

P(s)  *  B^(s)B~1(s)P1(s) ,  (5.9) 

consisting  of  the  two  Blaschke  products  Bz(s)  and 
Bp ( s)  determined  by  the  Re(s)  >  0  plant  zeros  and 
poles  respectively,  and  a  factor  ?1(a)  which  is 
always  in  H*.  The  burden  of  the  optimization 


VI  EXPLICIT  FORMULAS;  FEW  RHP  ZEROS 

6A.  One  RHP  Plant-Zero  The  results  here  are  similar 
to  those  of  (1)  ,  and  are  omitted  from  the  summary. 


6B.  Two  rhp  Plant  Zeros 

(b.  -  s)  (b  -  s) 

in  this  case.  P(s)  =  (g___r  ^  (s) B-l(s) 

where  b^,  i  =  1,2,  are  real  or  occur  in  conjugate 
pairs,  and  Re(b^)  >0.  Here  r  =  2.  By  Lemma  1,  the 
minimal  weighted  sensitivity  function  X  must  have 
the  form 


X(s)Bp1(s)  =  D(c  -  s)  (c  +  s)"1 

and  the  interpolation  constraints  are 
(c  -  b^) 

=  1,2. 


(c  +  b^) 


-  V 


(6.1! 


(6.2) 


If  D  is  eliminated  from  the  pair  of  equations  (6.2), 
a  quadratic  equation  is  obtained  for  c,  only  one  of 
whose  solutions  lies  in  Re(s)  g 0,  namely. 


<b2  -  bli 


e2  -  ex 


(b2  ‘ 


If  D  is  expressed  in  terms  of  c  using 
(6.3),  there  results 


b2  bx  (6.3) 
(6.2)  and 
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from  which  it  follows  that  since  y(P)  =  |d|*  the 

minimal  sensitivity  is 


vj(P) 


iV^ii 


(6.5) 


(In  the  language  of  [1],  (6.5)  is  the  measure  of 
singularity  of  P.) 


It  can  be  deduced  from  (6.5),  after  some  mani¬ 
pulations,  that  m(P)  £  max ( J 0  | , | 0^)  in  conformity 
with  Prop.  LAl-a. 


that  assume  the  value  0  at  b,  and  the  set  of  all  pass 
functions  x^(s)  of  norm  |m|  (whose  value  at  b  is 
arbitrary) .  This  property  of  (7.2) ,  together  with 
the  result  of  Lemma  1  that  the  smallest  H°“  function 
satisfying  an  j interpolation  constraint  is  a  unique 
allpass  function,  i.e.,  a  Balschko  produce,  immedi¬ 
ately  yield  the  following. 


7.1  prop  If  the  smallest  H*  function  assuming  the 
values  (0.  ,...0^)  at  the  r  points*  (b^.-.b^)  is 
X(s)  and  nas  norm  |m|,  then  the  smallest  h°°  function 
assuming  the  transformed  values 
!Ui <e2) ,  U1(03),...  Ul(0r)J,  Ux4Uq  b  ,  at  the 

(r-1)  points  (b2»...b  )  also  has  norm  j M [  and  is, 
uniquely,  (U^X) (s) . 


If  we  assume  that 
improper  Q  is 


Vs) 


1,  then  the  optimal 


2  =  [1 


D  (c  -  S) 
W(s) (c  +  s) 1 


(b^  +  s)  (b2  +  s) 
<bx  -  s)  (b2  -  s) 


If  W(s)  is  the  "lowpass"  weighting  ic  (s  +  *0 
k  >  0,  then 


Qls)  -  D  (bi  +  s) (b2  +  s) 
<  (c  +  s) 


which  is  a  load- type  function  with  an  extra  "break". 


VII  EXPLICIT  FORMULAS:  MANY  RHP  ZEROS 

The  r  simultaneous  equations  (5.11)  for  the 
coefficients  c.  can  be  solved  by  a  method  based  on 

i 

the  Schur-Pick-Nevanlinna  Theory  (See  Walsh  (2,  Ch. 
l'.'J  for  an  exposition)  . 


Let  M  be  fixed,  and  for  any  | 6 |  <  M  let 
denote  the  mapping  x  b  \i  satisfying  the  equation 


u(s) 


m"  [ X  (  s )  -  0) 
M2  -  0  X  ( s ) 


(7.1) 


(7.1)  maps  the  (solid)  ball  of  radius  |m|  of  H  in¬ 
to  itself,  and  the  set  of  allpass  functions  of  norm 
[M|  into  itself.  Moreover,  if  x(s)  satisfies  an 
interpolation  constraint  x(b)  =  6,  then  u(s)  has  a 
zero  at  b. 


(7.1)  is  now  modified  by  division  by  (b-s) (b+s) 
thereby  removing  the  zero  at  b  without  increase  of 
norm,  to  obtain  the  transformation  b;H*  >  H°°, 
x  Xj  which  satisfies 


(s) 


M*~  [x  (s)  -  0) 
M2  -  0  x  ( s ) 


(b  »  s) 
(b  -  s) 


(7.2) 


(7.2)  still  m  s  the  |m|  -  ball  of  H  into  itself. 
It  also  estab  »hes  a  1:1  cor  rcs)*ondcnc«*  between 
the  set  of  all  allpass  functions  x(s)  of  norm  |m| 


Prop  7.1  offers  the  possibility  of  reducing  any 
H  minimization  problem  involving  r  constraints  to  a 
problem  involving  (r-1)  new  constraints.  If  this 
reduction  is  repeated  r  times,  all  constraints  are 
removed,  and  a  constant  of  norm  f M |  remains.  Let  us 
detail  this  procedure. 


Recall  that  XBp  is  the  function  of  smallest 
H  norm  assuming  the  constraint  values  0^,...0r  at 
the  points  bj,...br  in  Re(s)  >  0.  Let  M  be  a  com¬ 
plex  constant  of  magnitude  II  xll  ,  and  angle  to  be 
determined.  Consider  an  iteration  on  the  constraint 
set,  as  follows.  Denote  the  initial  constraints  by 
<x<°>>,  i.e., 


(o) 


(o)  . 


For  any  i  *  1,  if  the  (i  -  l)st  constraint  set  has 
the  form 

{xf1  **  #xi+i^  *  •  •  ‘X^  **},  let  the  i-th  constraint 
set  be 


(,<«  ,  x(i)  , . .  .x^ }  =  {T 

1  l+l  i+2'  r  J  lT 


(i)  (i-1)  m(i)_(i-l) 

i+l 


where  denotes  the  transformation 

(i)  M2[x(s)-x|i  1*1  (b.  *  s) 

[I  ’x)(sl  4  - r~ -  - - r- 

(bi  -  S) 


We  have  the  following  main  result 


Theorem  2  a)  M  satisfies  the  equation 


tT(r)T(r~1,...T<1)) (b  ) 


(7.3) 


(which,  after  evaluation,  yields  an  algebraic  equa¬ 
tion  in  the  single  variable  M) • 


b)  The  optimal  sensitivity  function  X  is 
determined  explicitly  by  the  equation 


X(s)  =  Bp(s)  [TU,r1[T<2V1...  (T1*1!*1  (M) 


•Where  Re  bi  >  0 
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where  [T^J  *  satisfies 

11  \  ,  M2{x(8)+x.(l  ^  ]  (b  -  s) 

{(TU,I  x}  (S)  1 


lM2  -  SL (i“l)x(s) ] (b  +  s) 


i  =  1, 2, . . . r . 

VIII  COMPARISON  WITH  WIENER-HOPF 


spond  to  disturbance  having  not  one  but  a  variety  of 
power  spectra.  Low  frequency  disturbances,  which 
must  be  attenuated,  occur  most  often.  However,  even 
very  high  frequency  disturbances  appear  occasionally, 
and  response  to  them  must  be  limited,  even  though 
attenuation  may  be  impractical.  This  situation  is 
tailor-made  for  an  H00  description. 

Suppose  that  disturbance  spectra  can  be  arbi¬ 
trary  subject  to  the  upper  bound  |d(  jui)  |  <_  |w(ju>)|, 
where 


QUADRATIC  MINIMIZATION 


The  well  known  Wiener-Hopf  quadratic  minimiza¬ 
tion  method,  e.g.  in  Youla  et  al.  [6],  provides  the 
optimal  feedback  for  disturbances  of  fixed  power 
spectrum. 

In  Fig.  1,  let  d  be  a  disturbance  obtained  by 
passing  unit  variance  white  noise  through  the 
filter  W(s).  The  mean-square  value  of  the  output  y 
is 

2“  | 2  dw  (8.1) 


W(s) 


0. lb  +  ;s  3 
0.1b  +  s 


5 


1/3 


The  weighting  here  is  substantial  for  frequencies 

smaller  than  b  ,  (|w(juj)|  >  0.707  for  |n|e  10,0.1b]) 

then  drops  off  quite  rapidly,  but  never  falls  below 
the  lower  bound  ^  =  1/20.  The  optimal  unweighted 
sensitivity  function  for  this  problem,  obtained  as 
in  Sect.  6A,  is 

(1  -  PQ)“  W(b)/W(s)=  0.077  (  -  $  )3 

0.1b  +  is 

and  is  illustrated  in  Fig.  3. 


and  the  Wiener-Hopf  objective  is  to  minimize  (8.1) 
subject  to  the  constraint  that  F  stabilizes  the  CL 
system. 

Introduce  the  Hardy  space  H2  of  functions  G(s) 
which  are  analytic  in  Re(s)  >0  and  for  which  the 
norm  is  IIgI!2  4  sup^^l/*  1^  dw]1/2.  This 

norm  can  be  '■.hown  to  be  computable  on  the  jw-axis, 
and  Mdf2  =  ( /°^jG(juj)|2 


The  Wiener-Hopf  problem  is  equivalent  to  the 
problem  of  minimizing  the  H2  norm  of  the  weighted 
sensitivity  function  X,  with  the  same  constraints 
as  in  Prop  3Bl,  and  can  be  solved  by  the  same  method 
as  the  H  problem.  The  result  is  that  there  is  a 
unique  weighted  sensitivity  function  X  in  H2  of 
minimum  norm,  which  is  characterized  by  the  formula* 


X 


(s) 


<Hs) 

nr  , ib.  *si 
1  =  1  1 


(8.2) 


where  <Ms)  is  a  polynomial  of  degree  <  r,  which  is 
uniquely  determined  by  the  plant-zero  interpolation 
constraints  (3.7).  $(s)  can  be  computed,  e.g.,  by 

the  Lagrange  interpolation  formula.  (Eq.  (8.2)  is 
equivalent  to  (41)  of  [6).) 


Although  there  is  no  completely  satisfactory 
description  of  this  problem  in  H2,  a  widely  used 
current  approach  is  to  design  X  as  though  |w(s) | 
were  a  fixed  poewr  spectrum.  Strictly  speaking  this 
is  possible  only  if  W(jui)  €h2,  i.e.,  if  £  =  0.  In 
that  case  r  =  1  in  (8.2),  £(s)  is  a  constant,  and  on 
matching  interpolation  constraints,  the  result 

(1  -  PQ)-[w(b)/fc(s>]^-  =  u  .  PQ)  2iL  (8.3) 

is  obtained.  The  unweighted  sensitivity  function  is 
greater  by  a  factor  of  nearly  two  over  all  frequen¬ 
cies  of  importance  than~in  the  h"  case.  If  £  ?  0, 
W(s)  can  be  approximated  in  a  distributional  sense 
by  a  sequence  of  H2  functions,  and  the  result  (8.3) 
and  conclusion  still  hold  in  the  limit,  as  shown  in 
Fig.  3. 

2 

Remark .  The  H  method  forces  the  integral-squared 
value  of  the  unweighted  sensitivity  function  to  be 
small  over  high  frequencies  (i.e.,  over  (b,<*»)) 
where  in  fact  the  integral  is  of  no  consequence  (in¬ 
deed,  in  practice  can  not  be  finite,  as  P$  is 
strictly  proper) ,  and  where  in  reality  it  is  enough 
to  maintain  an  upper- bound  on  that  unweighted  sensi¬ 
tivity  function.  This  gratuitous  reduction  comes 
at  the  expense  of  the  sensitivity  at  all  frequencies 
of  importance. 


8A.  An  Example 

Let  us  consider  a  representative  servo  problem 
involving  a  single  RHP  plant  zero,  and  compare  the 
H®  and  H2  solutions. 

As  RHP  plant  poles  affect  both  solutions  similar¬ 
ly  (cf.  (8.2)  and  (4.2)),  the  plant  will  be  assumed 
stable.  Suppose  that 

P(s)= (b-s) (b4s)  1P^(s),  b  >  0,  €  h" 

Typically,  servomechanisms  must  be  able  to  re- 


8B.  Conclusions 

If  the  disturbance  power-spectrum  exactly  equals 
|w(ju>)|,  quadratic  optimization  gives  the  best  sen¬ 
sitivity  function.  However,  this  sensitivity  func¬ 
tion  can  be  quite  poor  for  other  spectra,  even  with¬ 
in  the  passband  of  W(ju»).  For  example,  it  is  poor 
for  a  disturbance  whose  power  is  concentrated  in  a 
narrow  band  around  the  frequency  of  the  peak  in 
Fig.  3. 

•For  plants  with  conjugate  symmetry. 
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(1  -  P(jw»Q(jvO)) 


H  optimization,  on  the  other  hand,  gives  a 
sensitivity  function  optimized  for  a  set  of  dis¬ 
turbance  power -spectra,  namely,  those  bounded  by 
|w(jw)[  (but  arbitrary  otherwise). 

Of  course,  neither  the  H  nor  the  quadratic 
solutions  presented  here  consider  practical  issues 
of  bandwidth  cost,  power  cost,  etc.  However,  the 
H  formulation  appears  better  able  to  cope  with  those 
systems  that  are  required  to  tolerate  large  classes 
of  disturbance  power -spectra,  as  feedback  servo¬ 
mechanisms  are  (and  as  opposed,  say,  to  single¬ 
source  communications  systems) . 
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ABS:  The  transfer  function  matrices  K( lambda)  of  80A36644 

high-performance,  high-gain  error-actuated  controllers 
for  a  practically  important  class  of  controllable  and 
observable  linear  multivariable  plants  are  expressed 
directly  In  terms  of  properties  of  the  transfer 
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